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"Synthetic Polynucleotides" 

TECHNICAL FIELD 

The present invention relates to synthetic 
polynucleotides encoding proteins and polypeptides 
5 selected from the group consisting of lysyl oxidases, 

lysyl oxidase- like proteins and variants thereof; to use 
of the synthetic polynucleotides to provide high yields 
of those proteins and polypeptides; and to uses of the 
proteins and polypeptides produced from the synthetic 
10 polynucleotides. 

BACKGROUND ART 

Lysyl oxidase is a copper dependent amine oxidase 
that catalyses the oxidation of amines, including but not 
limited to primary amines and, in particular, the amine 
15 side chain of lysine. Lysine oxidation catalysed by 

lysyl oxidase has been observed in the oligopeptide and 
polypeptide chains of collagen and tropoelastin. Lysyl 
oxidase activity has been observed with other amine 
containing substrates, such as oligopeptides where the 

2 0 efficiency of catalyzed oxidation is dependent on 

adjacent sequences (Kagan et al . , 1995a) such as vicinal 
dicarboxylic amino acid residues (Nagan and Kagan, 1994) . 

With varying efficiencies, lysyl oxidases can 
oxidise other substrates, such as butylamine and p- 
25 hydroxybenzyl amine, for example, to form butyraldehyde 
and p-hydroxybenzylaldehyde respectively. Non-peptide 
reactivity is also displayed with, for example, semi- 
carbazide(s) and oxidation of tyramine (Palcic et al., 
1995) . Lysyl oxidase activity has also been observed 

3 0 with other amine containing molecules including 

inhibitors and unnatural substrates such as trans-2- 
phenylcyclopropylamine (Shah et al., 1993). 

The majority of the work done to date in 
characterising lysyl oxidase has been with respect to 
35 non-human mammalian lysyl oxidases. The level of amino 
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acid homology between lysyl oxidases from different 
species is of the order of 90%. 

"Lysyl oxidase- like proteins" have also been 
identified by the analysis of nucleotide and predicted 
5 amino acid sequence alignments of DNA and protein 

molecules which are expressed in a particular mammalian 
species including humans. The level of homology between 
these proteins and lysyl oxidases is of the order of 75%, 
which is a highly significant degree of homology. 

10 Evidence indicates that these molecules may function as 
enzymes in the extracelluar space as members of a lysyl 
oxidase family of molecules, and in the cell to provide 
lysyl oxidase activities (Kim, Y. et al. 1995). 

The lysyl oxidase gene encodes a single polypeptide 

15 species and there has been no observation of lysyl 

oxidase mRNA splice variants or isoforms of the lysyl 
oxidase polypeptide in vivo (Boyd et al., 1995). 

In the production of intra- and inter- molecular 
crosslinked molecules, lysine oxidation in tropoelastin 

20 is a necessary step in the formation of allysine, 
desmosine and isodesmosine condensation products. 

Crosslinked molecules, including elastin and 
collagen, are significant components of fully functional 
connective tissue. In this regard, deficiencies in lysyl 

25 oxidase such as that found in lathyrism, lead to marked 
phenotypic changes that can compromise the viability of 
an individual . 

Lysyl oxidase exists in at least two forms: an 
intracellular protein and the more thoroughly 

30 characterised extracellular form. In the intracellular 
form its roles include a ras recission function, and the 
encoding gene has been classified in this context as a 
ras-recission gene or rrg (Contente et al . , 1990; Kenyon 
et al., 1991). In its capacity as rrg, its expression is 

35 altered in an incompletely catalogued manner, to 

apparently reduce the oncogenic phenotype of cells 
expressing aberrant ras. Lysyl oxidase levels also 
change during differentiation and development 
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(Dimaculangan et al., 1994) and in response to growth 
factors (Green et al, 1995) . Lysyl oxidase is also a 
secreted protein, available in the extracellular matrix 
of some connective tissues in very low concentrations. 
5 Difficulties in obtaining sufficient quantities of 

enzyme for biochemical analysis have impeded detailed 
exploration of its properties. Yields of purified 
naturally occurring lysyl oxidase available from typical 
purification procedures have been limited to 2-4mg 

10 starting with 0.5 to 1kg of cleaned bovine aortae (Kagan 
and Cai, 1995) . 

Extracellular lysyl oxidase is typically made as a 
larger protein, which includes a collection of amino acid 
residues at the amino terminus of the protein. This 

15 form, termed preprolysyl oxidase is secreted from the 
cell in a process contemporaneous with cleavage of the 
amino- terminal region to generate prolysyl oxidase which 
in turn is cleaved outside the cell to generate the 
mature form of the protein. 

2 0 The extracellular (and possibly intracellular) lysyl 

oxidase enzyme additionally contains copper and at least 
one organic cof actor which is postulated to be a quinone - 
like component. The organic cof actor has variously 
been considered to be covalently or non- covalently bound 

25 to the enzyme (Williamson et al . , 1986; Kagan and 

Trackman, 1991) . Examples of organic cofactors, which 
have been found to be associated with oxidases, include 
quinones such as P.Q.Q., topa quinone (2,4,5 
trihydroxyphenylalanine quinone) and tryptophan 

30 tryptophylquinone (Tanizawa, 1995) . The organic cof actor 
for lysyl oxidase is now known to be a quinone. It is 
possible that the cofactor can be supplied, generated by 
interaction of a quinone derivative with the depleted 
apo-form of the enzyme, and/or generated by protein 

35 oxidation such as that mediated by the participation of 
copper. There is at least one atom of copper tightly 
bound per one molecule of functional enzyme. However it 
was experimentally demonstrated (Gacheru et al . , 1990) 
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that when copper is removed by a chelating agent the 
inactive apoenzyme can be restored to its former level of 
activity by the addition of copper. 

Posttranslational modif ication(s) such as 
5 glycosylation have not been described for the mature form 
of lysyl oxidase, although there is evidence for 
glycosylation of the amino terminal region destined for 
removal during maturation of the precursor prolysyl 
oxidase (Cronshaw et al., 1995). As other forms of 

10 modification exist in vertebrate cells, and include 
participation by the cytosol , Golgi and secretion 
machinery, followed by extracellular processing, 
uncertainty has surrounded the question of whether a 
functional lysyl oxidase could be made in host cells 

15 other than verterbrate systems. 

Isolation of endogenous lysyl oxidase from, for 
example, mammalian tissue typically uses chemical agents 
which interfere with protein association, such as urea. 
Modest amounts of lysyl oxidase can be recovered in this 

20 way from skin (Shackleton and Hulmes, 1990) . The 

purified material is typically maintained in solutions 
containing chemical agent (s) , in part to minimise protein 
aggregation and loss of catalytic function. When 
prepared in this way, the enzyme displays sluggish 

25 activity, which has led to the assertion that the enzyme 
(or a macromolecular complex) (Cronshaw et al., 1993) is 
altered during the relatively harsh extraction procedure 
(Shackleton and Hulmes, 1990) . 

To circumvent difficulties associated with the use 

30 of the purified naturally occurring enzyme, recombinant 

production is a logical alternative. However despite the 
fact that inferred lysyl oxidase sequences including 
human (Mariani et al . , 1992; Svinarich et al . , 1992; 
Kenyon et al., 1993), rat (Trackman et al., 1990) and 

35 mouse (Contente et al . , 1993) have been available for 
several years there have been no reports in the 
literature of production of recombinant functional lysyl 
oxidase using non -mammalian hosts for other than modest 
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levels of expression and no functional expression of the 
human sequence. Kagan et al. (1995b), have reported the 
production of modest levels of porcine lysyl oxidase from 
cDNA in mammalian cells indicating that bacterial 
5 production of functional enzyme was problematic. One of 
the impediments to the production of recombinant lysyl 
oxidase has been the discrepancies appearing in some of 
the available sequence information (e.g. Trackman et al., 
1990; Hamalainen et al . , 1991 vs Mariani et al . , 1992). 

10 DESCRIPTION OF THE INVENTION 

The paucity of information in the literature in 
relation to the nature of host- specif ic post- 
translational modifications of lysyl oxidase (e.g. 
Cronshaw et al. 1995), together with the real absence of 

15 reports of functional lysyl oxidase production in non- 

mammalian recombinant expression systems, teaches against 
the choice of non-mammalian hosts for expression of 
nucleotide sequences encoding mammalian lysyl oxidase and 
implies that non-mammalian host cells are unlikely to 

20 produce functional lysyl oxidase which is encoded by a 
mammalian nucleotide sequence. 

The present inventor has reasoned that expression of 
mammalian lysyl oxidase in non-mammalian host cells might 
be achieved using a synthetic polynucleotide which 

25 encodes lysyl oxidase but which has been specifically 

designed for use in non-mammalian host cells such as (but 
not limited to) Escherichia coli, Saccharomyces 
cerevisiae and Pichia. In particular, the inventor 
proposed the hypothesis that non-mammalian host cells 

3 0 have difficulty expressing from nucleotide sequences 

encoding mammalian lysyl oxidase because of the presence 
of 'rare' codons in those sequences which are used 
infrequently in the non-mammalian host cell. Rare codons 
are codons that are found at low frequency in highly 

3 5 expressed genes in a particular host. A measure of 

efficiencies of codon usage is the codon adaptation index 
(CAI) (Sharp and Li 1987) . Highly expressed genes such 
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as those encoding ribosomal proteins in E. coll typically 
have high CAI values that are greater than or equal to 
0.45, whilst poorly expressed genes have much smaller CAI 
values; for example, the E . coll lad gene has a CAI of 
5 0.3 

The inventor calculated the CAI of a nucleotide 
sequence encoding a secreted form of human lysyl oxidase 
for a number of non -mammalian hosts by considering the 
usage of codons in the nucleotide sequence shown in 
10 Figure 1. The CAI is calculated to be 0.24 for E. coll 
and 0.11 for Saccharomyces cerevlslae. The inventor 
identified codons in the nucleotide sequence encoding 
human lysyl oxidase which are infrequently used in host 
cells such as E. coll including: an AGGAGA cluster 
encoding adjacent arginine residues at positions 937 to 
942 which has the capacity to limit expression and 
generate translational pausing and f rameshif ting in E. 
coll; ii other instances of single rare AGG and AGA 
arginine codons capable of instigating translational 
hopping (Kane 1992); and an ensemble of codons rarely 
found in highly expressed E. coll genes (Table l) . 
Together these codons account for 28% of the amino acid 
coding capacity of the sequence. 



15 



20 
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Table 1 






Codons found in the natural 


lysyl 


oxidase sequence of 


Figure 1 which are rarely found in highly expressed 


E. coli genes 










Codon 


Amino 


Frequency 


Occurrence 


in Figure 




acid 


in highly 


1 residues 








expressed 


{% total sequence) 






o a a 

ycnea 


s- 

0 






AGG 


Arg 


0.14 




6 


2.4 


AGA 


Arg 


0.21 




7 


2 n 


CGA 


Arg 


0.31 




2 


0.8 




Pro 


0 .43 




7 


2.8 


TGT 


Cys 


0.47 




7 


2.8 


TGC 


Cys 


0.61 




3 


1.2 


ACA 


Thr 


0.65 




7 


2.8 


CCT 


Pro 


0.66 




6 


2.4 


TCA 


Ser 


0.68 




3 


1.2 


GGA 


Gly 


0.70 




4 


1.6 


AGT 


Ser 


0.72 




6 


2.4 


CCA 


Pro 


0.82 




2 


0.8 


TCC 


Ser 


0.94 




4 


1.2 


GGG 


Gly 


0.97 




2 


0.8 


CTG 


Leu 


0.99 




6 


2.4 



In consideration of the CAI values for the 
nucleotide sequence encoding human lysyl oxidase in E. 
coli and Saccharomyces cerevisiae, the inventor reasoned 
that the problem associated with expression of 

25 verterbrate lysyl oxidase or lysyl oxidase -like molecules 
in particular host cells or expression systems was 
directly related to the high proportion of codons in the 
verterbrate lysyl oxidase or lysyl oxidase- like protein 
nucleotide sequence which are rarely used by the 

30 particular host cells. The inventor further reasoned 
that verterbrate lysyl oxidase and lysyl oxidase- like 
molecules might be expressed in alternative host cells if 
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the nucleotide sequences encoding those molecules were 
able to be specifically designed so as to favour higher 
levels of expression of lysyl oxidase in a particular 
alternative host cell . The inventor reasoned that 
5 nucleotide sequences which encode verterbrate lysyl 
oxidase or lysyl oxidase- like proteins and which have 
been specifically designed for high level expression in a 
particular host may also be suitable for expression in 
other host cells provided that the specifically designed 

10 nucleotide sequence contained codons which were 

frequently used by the other host cells. Moreover, the 
inventor realised that the concept of modifying the 
nucleotide sequence would find application in producing 
high level expression of lysyl oxidase or lysyl oxidase - 

15 like proteins in the natural host cell, in particular, 
high level expression of nucleotide sequences encoding 
human lysyl oxidase and lysyl oxidase- like proteins in 
human cells. 

The present invention provides high level expression 

20 of nucleotide sequences encoding lysyl oxidases and lysyl 
oxidase- like proteins in recombinant expression systems. 
The inventor has recognised that lysyl oxidases, lysyl 
oxidase- like proteins and variants thereof can be used in 
a variety of, for instance, pharmaceutical applications, 

25 but these uses require significant quantities of lysyl 

oxidase, lysyl oxidase- like proteins or variants thereof 
in a native conformation. Thus the realisation of the 
potential application of lysyl oxidase, lysyl oxidase- 
like expression products and variants thereof first 

30 requires a system which allows high level production of 
those products in native conformation. 

The benefits of being able to produce significant 
amounts of expression product in a functional 
conformation include specific oxidation, rather than 

35 random oxidation, so as to produce cross -linked complexes 
including cross- linked elastin or cross -linked collagen 
which are structurally and functionally similar to normal 
physiological complexes. Secondly, the expression 
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product which exists in a functional or native 
conformation is unlikely to cause an adverse reaction in 
vivo, whether the expression product is directly 
administered in vivo or in the situation where residual 
5 expression product remains in a cross -linked complex 

after catalysis, wherein that complex is intended for in 
vivo use. 

The invention provides for high level expression of 
vertebrate nucleotide sequences encoding lysyl oxidase 

10 and lysyl oxidase- like proteins by generating synthetic 
polynucleotides which encode lysyl oxidase or lysyl 
oxidase- like proteins. Synthetic polynucleotides are 
nucleotide sequences which have been specifically 
designed for high level expression of lysyl oxidase or 

15 lysyl oxidase -like proteins in particular host cells. 

Synthetic polynucleotides of the invention are 
generated by introducing specific nucleotide mutations at 
specific positions with respect to a particular codon, so 
as to replace codons which are rarely used, with codons 

20 which are frequently used by the particular host. 

Typically the mutation will not encode an amino acid 
mutation. To this extent, the nucleotide mutations 
within the synthetic polynucleotide are silent mutations. 
The number of codons corresponding to each amino acid 

25 residue is shown in Table 2. As a result of the 

degeneracy of the genetic code it is recognised that for 
some amino acid residues, silent mutations may be 
introduced at any one of nucleotide positions 1, 2 or 3 
of a particular codon. It is further recognised that for 

30 some amino acid residues, for example cysteine, arginine, 
glutamine, few silent mutations may be introduced. In 
particular methionine and tryptophan residues are both 
generally encoded by a single codon. Nonetheless, the 
present inventor anticipates that, as a result of the 

35 known low stringency codon- anti- codon interaction at 
nucleotide position 3 which gives rise to the "wobble 
effect", (Crick, F. et al. 1966) further silent mutations 
may be introduced at position 3, so as to increase the 
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number of codons for a particular amino acid residue 



which may be selected by a particular host cell. 



The Genetic Code 


First 
Posi tion 
(5 1 end) 


Second Position 


Position 
(3< end) 




U 


C 


A 


G 




U 


PHE 
PHE 
LEU 
LEU 


SER 
SER 
SER 
SER 


TYR 
TYR 
Stop 
Stop 


CYS 
CYS 
Stop 
TRP 


U 
C 
A 
G 


C 


LEU 
LEU 
LEU 
LEU 


PRO 
PRO 
PRO 
PRO 


HIS 
HIS 
GLN 
GLN 


ARG 
ARG 
ARG 
ARG 


u 

C 
A 
G 


A 


ILE 
ILE 
ILE 
MET 


THR 
THR 
THR 
THR 


ASN 
ASN 
LYS 
LYB 


SER 
SER 
ARG 
ARG 


U 
C 
A 
G 


G 


VAL 
VAX, 
VAL 
VAL 


ALA 
ALA 
ALA 
ALA 


ASP 
ASP 
GLU 
GLU 


GLY 
GLY 
GLY 
GLY 


U 
C 
A 
G 


Note: Given the position of the bases in a codon, it is 

possible to find the corresponding amino acid. For 
example, the codon (5')AUG(3') on mRNA specifies 
methionine, whereas CAU specifies histidine. UAA, UAG 
and UGA are termination signals. AUG is part of the 
initiation signal, and it codes for internal 
methionines as well . 



The present invention also relates to synthetic 
polynucleotides encoding lysyl oxidase or lysyl oxidase- 

2 0 like proteins wherein the nucleotide sequence of the 

synthetic polynucleotide comprises mutations which encode 
amino acid mutations. The lysyl oxidase and lysyl 
oxidase- like proteins which contain amino acid sequence 
mutations are variants of lysyl oxidase or lysyl oxidase- 

25 like proteins and are also described herein as variants 
of the invention. As described below, variants of the 
invention retain the functional properties of lysyl 
oxidase and lysyl oxidase- like proteins, namely that they 
catalyse oxidation of amine groups including lysine. The 
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amino acid sequence of lysyl oxidase or lysyl oxidase - 
like proteins may be mutated by addition, deletion, 
substitution or combinations thereof of the encoding 
nucleotide sequence and such altered molecules are 

5 variants of the invention provided they retain the 
required activity. In particular, the inventor has 
recognised that the amino acid sequence of lysyl oxidase 
or lysyl oxidase -like proteins may be altered without 
significantly changing the specificity or efficacy of the 

0 enzyme, so that the enzyme containing the amino acid 
mutation is substantially the same as the enzyme 
comprising the native amino acid sequence. The inventor 
also recognises that the specific activity and substrate 
specificity may be altered by mutation of lysyl oxidase 

5 or lysyl oxidase- like protein amino acid sequence. 

Thus the present invention further provides amino 
acid sequence variants of lysyl oxidase and lysyl 
oxidase- like proteins. Variants of the invention retain 
the functional properties of lysyl oxidase and lysyl 

0 oxidase- like proteins, namely that they catalyse 
oxidation of amine groups including lysine. Variants of 
the invention have amino acid sequences which are 
homologous with the amino acid sequence of lysyl oxidase 
or lysyl oxidase- like proteins. For the purposes of this 

5 description, "homology" between the amino acid sequence 
of a lysyl oxidase or lysyl oxidase- like protein and that 
of a variant connotes a likeness short of identity, 
indicative of a derivation of one sequence from the 
other. In particular, an amino acid sequence is 

) homologous to that of a lysyl oxidase or lysyl oxidase- 
like protein if the alignment of that amino acid sequence 
with the sequence of a lysyl oxidase or a lysyl oxidase - 
like protein reveals a similarity of about 65% over any 
20 amino acid stretch or over any repetitive element of 

1 the molecules shorter than 20 amino acids in length. 
Such a sequence comparison can be performed via known 
algorithims such as the one described by Lipman and 
Pearson (1985), which are readily implemented by 
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computer. Similarity is observed between amino acids 
where those amino acids have a side chain which confers a 
similar chemical property in the same chemical 
environment. For example, threonine and serine are 
5 similar amino acids; aspartic acid and glutamic acid are 
similar amino acids; valine, leucine and isoleucine are 
similar amino acids etc. Thus an amino acid sequence may 
be considered homologous with the amino acid sequence of 
a lysyl oxidase or a lysyl oxidase- like protein because 

10 the alignment of those sequences reveals a similarity of 
65%, although at each amino acid position in the aligned 
sequences, none of the residues are identical. 

According to a first aspect of the present 
invention, there is provided a synthetic polynucleotide 

15 encoding an amino acid sequence selected from the group 

consisting of lysyl oxidases, lysyl oxidase -like proteins 
and variants thereof. 

The lysyl oxidase may be a vertebrate including 
human, bovine, porcine, rat or mouse lysyl oxidase, or 

20 avian lysyl oxidase including chick lysyl oxidase. 

Preferably, the lysyl oxidase is a human lysyl oxidase. 
The nucleotide sequences of rat and mouse lysyl oxidase 
are reported in Trackman et al . (1991), porcine in 
Cronshaw et al . (1993), mouse in Contente et al . (1993), 

25 and chick in Wu et al. (1992) . 

The lysyl oxidase- like proteins of the present 
invention are proteins which exhibit an amine oxidation 
catalysing function. 

The variants of lysyl oxidases and lysyl oxidase - 

3 0 like proteins in accordance with the present invention 
may be truncated forms of the lysyl oxidase or lysyl 
oxidase- like proteins wherein the truncated forms retain 
the functional properties of lysyl oxidase and lysyl 
oxidase- like proteins, in particular the ability to 

35 catalyse the oxidation of amine groups, including lysine. 
In toto, the truncated forms are typically greater than 
20 amino acids in length. 
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The variants of the invention also include molecules 
that comprise a particular sequence of amino acids or a 
particular domain of a lysyl oxidase or lysyl 
oxidase -like protein wherein the sequence of amino acids 
5 or domain retain the functional properties of lysyl 

oxidase and lysyl oxidase -like proteins, in particular 
the ability to catalyse the oxidation of amine groups, 
including lysine. 

Synthetic polynucleotides according to the invention 
10 are generated by mutating the native nucleotide sequence 
encoding a lysyl oxidase or lysyl oxidase- like protein or 
variant so that: 

a) all or some of the codons which hamper 
expression in the expression system in which the 

15 polynucleotide is to be expressed, are replaced 

with codons more favourable for expression in 
the expression system; and 

b) the altered sequence encodes a protein or 
polypeptide selected from the group consisting 

20 of lysyl oxidases, lysyl oxidase-like proteins 

and variants thereof. 
In one preferred embodiment the synthetic 
polynucleotide comprises a nucleotide sequence which 
encodes lysyl oxidase wherein the nucleotide sequence has 

25 a CAT score of at least 0.3 in a particular host cell. 

More preferably the synthetic polynucleotide comprises a 
nucleotide sequence which encodes human lysyl oxidase, 
wherein the synthetic polynucleotide has a CAI of at 
least 0.3 in E. coli. Most preferably the synthetic 

30 polynucleotide comprises all or part of the nucleotide 
sequence shown in Figure 2 (SEQ ID NO:3>, starting at 
residue 141 (Met) or 144 (Asp) . In the sequence shown in 
Figure 2 (SEQ ID NO :3) the N- terminal encoded amino acid 
is serine. The presence of the serine residue is the 

35 consequence of the insertion of a BamHI restriction site 
at this position. It will be readily recognised that 
this serine residue could be deleted or replaced by an 
alternative residue related to the restriction site 
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present at the N- terminal end of the construct. 
Similarly, at the 3' end of the construct, additional 
stop codons are inserted compared to the parent sequence 
shown in Figure 1 (SEQ ID NO: 1) . These stop codons are 
5 a feature of this particular construct. 

With respect to the sequence of the lysyl oxidase 
molecule, the first 21 amino acids of the sequence shown 
in Figure 1 (SEQ ID NO: 2) are amino acids of the pre- 
sequence. The residues from this position and up to and 
10 including residue 143 comprise the pro- sequence. Thus 

the mature form of the lysyl oxidase commences at residue 
144 . These different constructs are encompassed within 
the synthetic polynucleotides and encoded products of the 
invention. 

15 In another preferred embodiment, at least 50% of 

codons in a particular nucleotide sequence which may be 
altered so as to reflect frequent codon usage in a 
particular host, are selected and altered. More 
preferably at least 60% are altered. Most preferably at 

20 least 70% are altered. 

In another preferred embodiment at least 50% of 
codons which encode a particular amino acid are altered, 
so as to reflect frequent codon usage in a particualar 
host. More preferably at least 60% are altered. Most 

25 preferably at least 70% are altered. 

Modifications to codon usage to provide enhanced 
expression for a variety of hosts are discussed in: 
Zhang et al . (1991), for E. coli, yeast, fruit fly and 
primates where codon usage tables are provided; 

30 Newgard et al . (1986), for mammals; and 

Murray et al . (1989), for plants. Preferred codon usages 
are indicated in these publications. 

It is important to recognise that even where human 
originating nucleotide sequences are to be expressed in 

35 human cell lines, or other mammalian cell lines, that 

modifications to codon usage can still be beneficial to 
producing enhanced levels of the protein of interest. 
The same applies to expression of other lysyl oxidase and 
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lysyl oxidase like proteins in cell lines derived from 
their originating species or other mammalian or avian 
species . 

In another preferred embodiment, all or part of the 
5 5 1 or 3 ' untranslated region, or intronic regions of the 
synthetic nucleotide sequence encoding the lysyl oxidase 
or lysyl oxidase -like protein or variant thereof, are 
deleted . 

In another preferred embodiment, all or part of the 

10 regions encoding the signal peptide, or regions encoding 
the pro-peptide, of the synthetic nucleotide sequence 
encoding the lysyl oxidase or lysyl oxidase- like protein 
or variant thereof, are deleted. 

More preferably, all or part, of the 5' or 3 1 

15 untranslated region, and/or intronic regions, and regions 
encoding the signal peptide, and/or regions encoding the 
pro-peptide, of the synthetic polynucleotide encoding the 
lysyl oxidase or lysyl oxidase- like protein or variant 
thereof are deleted. 

20 It will be recognised that the strategies for 

specific synthetic gene construction to yield enhanced 
expression of lysyl oxidases described in detail herein, 
can readily be adapted to the synthesis of other lysyl 
oxidases, lysyl oxidase -like proteins and variants 

25 thereof not specifically exemplified here, in light of 
the significant degree of homology between these 
molecules and in light of the fact that the present 
inventor has established that these proteins can be 
expressed in functional form in significant amounts from 

30 particular host cells using synthetic gene constructs 
with enhanced codon usage. 

In another preferred embodiment the synthetic 
polynucleotide comprises a nucleotide sequence encoding 
lysyl oxidase, lysyl oxidase -like protein or a variant 

35 thereof and a further nucleotide sequence, wherein the 
nucleotide sequence encoding lysyl oxidase, lysyl 
oxidase- like protein or a variant thereof and the further 
nucleotide sequence are chemically linked so as to encode 
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a single open reading frame. The further nucleotide 
sequence is preferably chemically linked to the 5' end of 
the nucleotide sequence encoding lysyl oxidase, lysyl 
oxidase- like protein or a variant thereof. 
5 in another preferred embodiment, the further 

nucleotide sequence encodes all of part of a polypeptide 
which causes the lysyl oxidase, lysyl oxidase- like 
protein or variant thereof to be either secreted to the 
extracellular environment, expressed as a cell surface 

10 periplasmic protein, or expressed in the intracellular 

environment. Preferably the further nucleotide sequence 
encodes all or part of glutathione S- transferase. 

In another preferred embodiment the further 
nucleotide sequence encodes additional residues such as 

15 an N- terminal methionine or formyl -methionine . 

In a preferred form of the invention the synthetic 
polynucleotide is unfused. 

In another preferred embodiment, the synthetic 
polynucleotide comprises a fragment of the nucleotide 

20 sequence shown in Figure 2, wherein the fragment of the 
nucleotide sequence encodes a protein which retains the 
enzymatic activity of lysyl oxidase, lysyl oxidase -like 
protein or a variant thereof. Preferably the synthetic 
polynucleotide comprises the sequence shown in Figure 2 

25 and commencing at the codon encoding residue 141. In 

another preferred embodiment the synthetic polynucleotide 
comprises the sequence shown in Figure 2 and commences at 
or near the codon encoding residue 144 . 

In a further preferred embodiment, the synthetic 

30 polynucleotide comprises the nucleotide sequence shown in 
Figure 2 (SEQ ID NO: 3), or a fragment thereof, in which 
only some of the nucleotide mutations have been 
introduced. Preferably, the synthetic polynucleotide 
comprises the nucleotide sequence shown in Figure 2 (SEQ 

35 ID NO:3), or a fragment thereof, and at least 50% of the 
nucleotide mutations have been introduced. More 
preferably, the synthetic polynucleotide comprises the 
nucleotide sequence shown in Figure 2 (SEQ ID NO: 3) in 
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which the number of nucleotide mutations which have been 
introduced produce a CAI score for the synthetic 
polynucleotide in E. coli of at least 0.30. 

The production of the synthetic polynucleotides of 
5 the invention, which is described in detail in the 
accompanying Examples, can be via the techniques of 
organic synthesis or recombinant DNA technology, or a 
combination of both techniques. 

The synthetic polynucleotides described in the 
10 accompanying Examples were designed to make them amenable 
to cloning and expression in fused and unfused forms, and 
for intracellular and secreted production. 

Where the synthetic polynucleotide is prepared from 
assembled oligonucleotides it is preferred to incorporate 
15 restriction sites in the sequence to facilitate assembly 
of the polynucleotide. 

Restriction sites incorporated in the polynucleotide 
sequence are also useful for: 

1 . facilitating subcloning of manageable blocks for 
20 sequence confirmation; 

2. providing sites for later introduction of 
modifications to the polynucleotide as insertions, 
deletions or base changes; 

3 . facilitating confirmation of correct poly- 
25 nucleotide assembly by restriction endonuclease 

digestion. 

The synthetic polynucleotides encoding the 
variants of the invention can be produced by the 
techniques of site-directed mutagenesis or random 
30 mutagenesis. These techniques allow one to determine the 
effect of mutation at particular positions in the 
synthetic polynucleotide sequence on the amine oxidation 
catalysing properties of the encoded variant of the 
invention. 

3 5 In one preferred embodiment, the variants of the 

invention are produced by the technique of site-directed 
mutagenesis using oligonucleotides, which comprises the 
following steps: 
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1. synthesis of an oligonucleotide with a sequence 
that contains the desired nucleotide mutation; 

2. hybridizing the oligonucleotide to a template 
nucleotide sequence encoding lysyl oxidase or a lysyl 

5 oxidase- like protein; and 

3. using a DNA polymerase to extend from the 3' end 
of the oligonucleotide in the 5 1 to 3' direction. 

Another approach which is particularly suited to 
situations where the synthetic polynucleotide has been 

10 prepared from oligonucleotide blocks bounded by 

restriction sites is cassette mutagenesis where entire 
restriction fragments are inserted, deleted or replaced. 

According to a second aspect of the present 
invention there is provided a recombinant nucleic acid 

15 molecule comprising a synthetic polynucleotide of the 
first aspect. Preferably the recombinant nucleic acid 
molecule comprises vector nucleic acid. 

The vectors of the invention include plasmids, 
phages, and phagemids. The vectors of the invention can 

20 be used in integrative expression systems or lytic or 
comparable expression systems. 

Suitable vectors will generally contain origins of 
replication and control sequences which are derived from 
species compatible with the intended expression host. 

25 Typically these vectors include a promoter located 

upstream from the synthetic polynucleotide, together with 
a ribosome binding site if intended for prokaryotic 
expression, and a phenotypic selection gene such as one 
conferring antibiotic resistance or supplying an 

30 auxotrophic requirement. For production vectors, vectors 
which provide for enhanced stability through partitioning 
may be chosen. Where integrative vectors are used it is 
not necessary for the vector to have an origin of 
replication. Lytic and other comparable expression 

3 5 systems do not need to have those functions required for 
maintenance of vectors in hosts . 

For E. coli typical vectors include pBR322, 
pBluescript II SK + , pGEX-2T, pTrc99A, pET series vectors, 
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particularly pET3d, (Studier et al . 1990) and derivatives 
of these vectors. 

In one preferred embodiment, the vector comprises a 
synthetic polynucleotide of the first aspect, wherein the 
5 synthetic polynucleotide has a CAI score of at least 0.3 0 
in a particular host cell. Preferably the vector 
comprises all or part of a synthetic polynucleotide 
sequence shown in Figure 2 (SEQ ID NO: 3) . 

In another preferred embodiment, the vector 

10 comprises a synthetic polynucleotide of the first aspect 
wherein at least 50% of codons in a particular nucleotide 
sequence which may be altered so as to reflect frequent 
codon usage in a particular host, are selected and 
altered. More preferably at least 60% are altered. Most 

15 preferably at least 70% are altered. 

In another preferred embodiment the vector comprises 
a synthetic polynucleotide of the first aspect wherein at 
least 50% of codons which encode a particular amino acid 
are altered, so as to reflect frequent codon usage in a 

20 particular host. More preferably at least 60% are 
altered. Most preferably at least 70% are altered. 

In another preferred embodiment the vector comprises 
a synthetic polynucleotide of the first aspect wherein 
all or part of the 5' or 3' untranslated region, and/or 

25 intronic regions is deleted. 

In another preferred embodiment the vector comprises 
a synthetic polynucleotide of the first aspect wherein 
all or part of the regions encoding the signal peptide, 
and/or regions encoding the pro-peptide is deleted. 

30 In another preferred embodiment the vector comprises 

a synthetic polynucleotide of the first aspect wherein 
all or part, of the 5' or 3' untranslated region, and/or 
intronic regions, and/or all or part of the regions 
encoding the signal peptide, and or regions encoding the 

35 pro-peptide is deleted. 

In another preferred embodiment, the recombinant DNA 
molecule comprises a synthetic polynucleotide of the 
first aspect, wherein the synthetic polynucleotide 
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sequence is chemically linked to a further nucleotide 
sequence so as to encode a single open reading frame for 
the synthetic polynucleotide and the further nucleotide 
sequence. Preferably, the recombinant DNA molecule 
5 comprises a synthetic polynucleotide and the further 
nucleotide sequence wherein the further nucleotide 
sequence is chemically linked to the 5 1 end of the 
synthetic polynucleotide. 

In another preferred embodiment, the recombinant DNA 

10 molecule comprises a synthetic polynucleotide, wherein 

the synthetic polynucleotide comprises all or part of the 
nucleotide sequence shown in Figure 2 (SEQ ID NO: 2) . 
Preferably, the recombinant DNA molecule comprises a 
synthetic polynucleotide sequence having all or part of 

15 the sequence shown in Figure 2 (SEQ ID NO: 3) wherein only 
some of the indicated nucleotide mutations have been 
introduced. 

According to a third aspect of the present invention 
there is provided a host cell transformed with a 
20 recombinant DNA molecule of the second aspect of the 
invention. 

The host cells may be prokaryotic cells including 
bacteria, and may also include yeasts, insect cells and 
other eukaryotic cells or whole organisms. 
25 In a preferred embodiment, the host is an E. coll 

strain including E. coli B strain derivatives (Studier et 
al. 1990), and E. coll K12 derivatives such as NM522 
(Gough and Murray, 1983) and XLl-Blue (Bullock et al. 
1987) . 

3 0 In another preferred embodiment, the host is S. 

cerivisiae or Pichia. 

According to a fourth aspect of the present 
invention there is provided an expression product, 
wherein the expression product is derived from a host 

35 cell of the third aspect, the expression product 

comprising an amino acid sequence selected from the group 
consisting of lysyl oxidase, lysyl oxidase -like proteins 
and variants thereof. 
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In one preferred embodiment, the expression product 
comprises the amino acid sequence shown in Figure 2 (SEQ 
ID NO: 4) and is encoded by a synthetic polynucleotide 
which comprises the nucleotide sequence shown in (SEQ ID 
5 NO: 3) . 

In an alternative embodiment, the expression product 
comprises a fragment of the amino acid sequence shown in 
Figure 2 (SEQ ID NO: 4) which retains amine oxidation 
catalysing properties, and is encoded by a synthetic 

10 polynucleotide comprising the corresponding nucleotide 

sequence shown in Figure 2 (SEQ ID NO: 3) . Preferably the 
expression product comprises an amino acid sequence 
having an N- terminus at amino acid residue position 141 
(Met) of Figure 2 (SEQ ID NO: 4) or results in an N- 

15 terminal amino acid residue position 144 (Asp) of Figure 
2, (SEQ ID NO:4) and is encoded by a synthetic 
polynucleotide comprising the corresponding nucleotide 
sequence shown in Figure 2 (SEQ ID NO: 3) . 

In another preferred embodiment, the expression 

20 product is encoded by a synthetic polynucleotide 

comprising the nucleotide sequence shown in Figure 2 (SEQ 
ID NO : 3) in which only some of the nucleotide mutations 
have been introduced. Preferably the expression product 
is encoded by a synthetic polynucleotide comprising all 

25 or part of the nucleotide sequence shown in Figure 2 (SEQ 
ID NO: 3) and at least 50% of the nucleotide mutations 
relative to Figure 1 have been introduced. More 
preferably the expression product is encoded by a 
synthetic polynucleotide Comprising all or part of the 

30 nucleotide sequence shown in Figure 2 (SEQ ID NO: 3) in 

which the number of nucleotide mutations which have been 
introduced produce a CAI score for the synthetic 
polynucleotide in E. coli of at least 0.30. 

The expression products of the fourth aspect may be 

3 5 fused expression products which include all or part of a 
protein encoded by the vector in peptide linkage with the 
expression product. They may also include, for example, 
an N- terminal methionine or other additional residues 
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which do not permanently impair the catalytic properties 
of the product. 

In a further preferred embodiment, the expression 
products of the invention are fusion products which 
5 comprise an amino acid sequence selected from the group 
consisting of lysyl oxidase, lysyl oxidase-like proteins 
and variants thereof which are encoded by a synthetic 
polynucleotide, wherein the amino acid sequence is 
chemically linked to a further amino acid sequence. 

10 Typically, the further amino acid sequence is chemically 
linked to the N- terminus of the expression product. 
Preferably the further amino acid sequence allows 
purification of the fusion product. Most preferably the 
further amino acid sequence comprises an amino acid 

15 sequence encoding glutathione S- transferase. The 

further amino acid sequence may be chosen in order to 
cause the fusion product to be secreted to the 
extracellular evironment, expressed as a cell membrane 
protein or expressed in the intracellular environment. 

20 The expressed fusion products may subsequently be 

treated to remove the further amino acid sequence from 
the amino acid sequence selected from the group 
consisting of lysyl oxidase, lysyl oxidase-like proteins 
and variants thereof. Preferably, the treatment is via 

25 chemical cleavage. In another more preferred embodiment, 
the treatment is via protease digestion. Still more 
preferably, the treatment is effected by endogenous host 
cell secretion machinery, for example yeast cell 
secretion machinery. 

30 Non- fused systems include the introduction of or use 

of a pre-existing methionine codon. An example of this is 
the use of pET3a and pET3d in E. coli. 

In a further embodiment, the invention provides a 
pharmaceutical composition comprising an expression 

35 product of the invention and a pharmaceutically 
acceptable carrier, excipient or diluent. The 
pharmaceutical compositions are prepared and tested 
according to standard pharmaceutical practise. The 
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support for their efficacy is provided by the assays in 
the Examples. 

According to a fifth aspect of the invention there 
is provided a method for producing an expression product 
5 of the invention, the method comprising the following 
steps : 

(i) providing a host cell of the third aspect of the 
invention. 

(ii) maintaining the host cell in conditions 

10 suitable for expression of the synthetic polynucleotide; 
and 

(iii) isolating the expression product. 

In one preferred embodiment the expression product 
is produced by expression of the synthetic polynucleotide 

15 in E. coll. The invention extends to the use of other 

host cells which are capable of producing the expression 
product by expression of the synthetic polynucleotides 
designed for use in E. coli. 

In another preferred embodiment, the expression product 

20 is produced by expression of a synthetic polynucleotide 
of the invention in other microbial expression systems 
including prokaryotic expression systems such as 
bacterial expression systems, and also including insect 
cell expression systems, and expression systems involving 

25 other eukaryotic cells or whole organisms. Again 

synthetic polynucleotides which are designed for use in a 
particular host can also be used in other cells which 
provide high expression of that sequence. 

As disclosed in the Examples, the inventor has 

3 0 determined that it is useful to add Cu to the expression 
product which has been purified from the inclusion bodies 
of E. coli or similar organisms, so that the expression 
product can fold into a functional conformation which has 
the enzymatic activity of lysyl oxidase or lysyl oxidase - 

35 like proteins. The role of Cu in solubilising lysyl 
oxidase protein was not apparent until the present 
invention was carried out. Indeed based on earlier 
reported results which show that inactive apoenzyme can 
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be reactivated by the addition of Cu it would have been 
expected that re- folding could occur without Cu. The 
present inventor initially attempted to refold protein in 
the absence of Cu and failed to generate soluble 
5 protein. It was only in subsequent experiments where Cu 
was added that refolding in this environment succeeded. 

Further, should over expressed lysyl oxidase lack 
copper after its production in bacteria, the 
incorporation of copper into the recombinant protein 

10 would be possible by adapting the previously reported 

protocol (Gacheru et al . 1990) used for conversion of the 
apoenzyme of natural lysyl oxidase into the metal 
containing form. 

In addition, it would be desirable to add oxidising 

15 substances that assist in the production of the organic 
cofactor if this is not formed by the enzyme under the 
prevailing conditions without assistance. The most 
important additional substance is oxygen in solution. 
It is proposed here that generation of the active 

20 enzyme in vitro is a time -dependent process involving the 
introduction of copper to the overexpressed polypeptide, 
followed by oxidation of the protein to generate the 
covalently bound quinone. This is despite the absence of 
a Asn-Tyr-Asp/Glu consensus sequence from lysyl oxidase 

25 presumed to be important to this type of process 
(Tanizawa 1995) . 

The expression product may be exposed to chemical 
agents to enhance the stability and activity of the 
molecule. The chemical agents include various 

30 concentrations of urea or glycerol combined with storage 
at -20°C. 

In one preferred form the expression product is 
produced in the form of inclusion bodies which are 
harvested from the transformed host . 
35 As described above, the inventor recognised a 

variety of uses for lysyl oxidase, lysyl oxidase- like 
proteins and variants thereof although these uses 
require significant amounts of protein in functional 
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conformation. As the inventor has for the first time 
produce significant amounts of enzyme in functional 
conformation, the following uses, as described below can 
now be realised. 
5 The invention provides a method of cross -linking 

molecules that contain primary amines, wherein the 
molecules are associated with a cell membrane or 
extracellular environment, comprising contacting the 
primary amine with an expression product of the 

10 invention. 

As described in the Examples, the inventor now 
recognises that the expression products of the invention 
have specific activity for primary amine groups in 
peptide and non- peptide substrates. Furthermore, as 

15 described in the Examples, as the present inventor has, 
for the first time, observed cross- linking of both 
peptide and non-peptide substrates in vitro, using the 
expression product of the invention, the inventor 
envisages that the expression products of the invention 

20 will be useful for cross- linking a wide range of 

molecules such as peptides and polypeptides, non-peptide 
chemical polymers which contain primary amine groups and 
combinations thereof. In particular, the present 
inventor envisages that the expression product of the 

25 invention will be widely applicable to methods of 

treatment in vivo, in particular where those methods of 
treatment require cross- linking of elastin or collagen. 
For this purpose the expression product can be formulated 
in suitable carriers, buffers and other conventional 

30 delivery systems. 

In one preferred embodiment, the invention provides 
a method of cross -linking molecules that contain primary 
amines, wherein the molecule is a peptide or polypeptide 
molecule and the primary amine is associated with an 

35 amino acid side chain. The method comprises contacting 
the peptide or polypeptide molecule with an expression 
product of the invention under conditions suitable to 
cross -link the molecules. Preferably, the peptide or 
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polypeptide comprises the amino acid sequence of 
tropoelastin or a fragment thereof. In another preferred 
embodiment, the peptide or polypeptide comprises the 
amino acid sequence of collagen or a fragment thereof. 
5 It can also comprise both. 

In another preferred embodiment the invention 
provides a method of cross- linking molecules that contain 
primary amines, wherein the the molecule is a non-peptide 
molecule. The method comprises contacting the non- 
10 peptide molecule with an expression product of the 

invention under conditions suitable to cross -link the 
molecules . 

The present inventor also recognises that the 
expression products of the invention would find wide 

15 application in accelerating the rate of wound healing. In 
particular the inventor has recognised that lysyl 
oxidase, lysyl oxidase- like proteins and variants thereof 
may be used to accelerate wound healing. Wound healing 
may be accelerated by administering an expression product 

20 of the invention to the wound. Alternatively, an 

expression product of the invention may be included in a 
matrix including an elastin, tropoelastin and/or collagen 
based or amine containing matrix which is itself applied 
to the wound. In both methods, wound healing is 

25 accelerated by cross- linking of molecules which contain 
primary amines, and the cross- linking is mediated by the 
expression product of the invention. For this purpose 
the expression product can be formulated in suitable 
carriers, buffers and other conventional delivery 

30 systems. 

Thus the invention provides a method of accelerating 
the rate of wound healing, the method comprising the step 
of administering an expression product of the invention 
to a wound under conditions which are suitable for cross - 
35 linking of molecules containing primary amines. 

Preferably, an expression product of the invention is 
administered to the wound by direct application of the 
product to the wound. More preferably the product is 
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administered to the wound by first including the product 
in a matrix which is itself administered to the wound. 
Most preferably, the product is administered to the wound 
by application of a pharmaceutical formulation comprising 
5 the expression product of the invention and a 
pharmaceutically acceptable carrier, diluent or 
excipient . 

In another preferred embodiment, the invention 
relates to a method of deccelerating the rate of wound 

10 healing. The inventor recognises that there are 

particular surgical procedures which require a delay in 
wound healing so as to achieve a desirable outcome. A 
lysyl oxidase -mediated delay in wound healing is based on 
the ability to inhibit the oxidation of primary amines 

15 and therefore the ability to inhibit the cross- linking of 
particular molecules, where that cross -linking is 
essential to wound healing. Lysyl oxidase inhibitors are 
known, for example BAPN, although these reagents are 
typically noxious and therefor unsuitable for use in 

20 methods of treatment. The inventor envisages that a 
delay in wound healing which is mediated by the 
inhibition of oxidation of primary amines can be achieved 
by developing novel reagents which have specific anti- 
lysyl oxidase acitvities and which have no side-effects. 

25 Particularly suitable inhibitors of lysyl oxidase 

activities are anti- lysyl oxidase antibodies. These 
immunological reagents may be generated by raising 
antibodies to the expression products of the invention. 
The methods for the production of monoclonal antibodies 

3 0 or polyclonal antibodies are within the knowledge of the 
skilled addressee and are described in detail in standard 
immunological texts. Furthermore, the skilled addressee 
can readily determine those antibodies with anti -lysyl 
oxidase activities, in particular the ability to inhibit 

35 the oxidation of primary amines by performing screening 
of those antibodies according to the benzylamine 
substrate assay, tropoelastin substrate assay or 
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tritiated tropoelastin substrate assay, as described in 
the accompanying Examples . 

Thus, the invention provides a method of 
deccelerating the rate of wound healing, the method 
5 comprising the step of administering antibodies which are 
raised against the expression product of the invention, 
to a wound under conditions which allow the antibodies to 
substantially inhibit the oxidation of primary amines. 

In another aspect embodiment, the invention relates 

10 to novel reagents for modifying the rate of wound 

healing. In particular, the inventor recognises that the 
expression products of the invention can be used to link 
chemical species to heterologous surfaces to 
substantially promote or substantially inhibit the rate 

15 of wound healing. The chemical species of interest 

generally include those species which contain a primary 
amine and which can be linked by oxidation of the primary 
amine to a heterologous surface, but more specifically 
comprise those chemical species selected from the group 

20 consisting of immunomodulatory substances including 

growth factors and cytokines, and substances which modify 
inflammatory responses or the coagulation cascade and 
also antibiotic substances. The heterologous surface may 
be a matrix comprising a complex of biomolecules , in 

25 particular elastin or tropoelastin, or a complex of non- 
peptide molecules. The expression product of the 
invention may be used to form a linkage between two 
chemical species which contain primary amine groups. 
Thus the invention has particular application to the 

30 attachment of an active ingredient to a carrier molecule. 
Thus the invention provides novel reagents and 
formulations thereof for use in modifying the rate of 
wound healing, wherein the reagents are generated by 
oxidising a primary amine contained in a chemical species 

35 so as to link the chemical species to a heterologous 
surface . 

In another preferred embodiment, the invention 
relates to the use of an expression product of the 
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invention in the treatment or prevention of abnormal 
fibrosis. Lysyl oxidase has an important functional role 
in fibrosis and is also believed to be involved in the 
abnormal process of fibrosis, in particular in the 
5 fibrotic liver (Kagan H. M. 1994) . The present inventor 
envisages the use of the expression products of the 
invention in promoting fibrosis so as to achieve a normal 
physiological outcome and inhibiting fibrosis where an 
abnormal outcome is to be avoided. Fibrosis may be 

10 promoted by increasing the levels of available lysyl 
oxidase or by increasing the specific activity of the 
enzyme. For example fibrosis may be promoted by 
administering an expression product of the invention to a 
particular site so as to promote fibrosis at that site. 

15 Fibrosis may also be promoted by administering anti- lysyl 
oxidase antibodies which augment or enhance the specific 
activity of the enzyme, for example by interfering with 
aggregation of the protein. Fibrosis may be inhibited by 
inhibiting or reducing the ability of the lysyl oxidase 

20 to oxidise primary amine residues. The inventor 

envisages that fibrosis, as mediated through the role of 
lysyl oxidase in the oxidation of primary amines, could 
be inhibited by the use of antibodies which inhibit the 
oxidation reaction. As described above, the methods for 

25 the production of monoclonal antibodies or polyclonal 
antibodies are within the knowledge of the skilled 
addressee. Furthermore, the skilled addressee can 
readily determine those antibodies with anti -lysyl 
oxidase activities which either inhibit or enhance lysyl 

30 oxidase activity, in particular the ability to inhibit or 
enhance the oxidation of primary amines by performing 
screening assays according to the benzylamine substrate 
assay, tropoelastin substrate assay or tritiated 
tropoelastin substrate assay, as described in the 

35 accompanying Examples. 

As described above, three separate assays are 
disclosed in the accompanying Examples for measuring 
levels of lysyl oxidase activity. The inventor 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06830 



- 30 



PCT/AU97/00505 



recognises that the expression products of the invention 
could be used in a particular method so as to screen 
agents for the ability to inhibit or enhance lysyl 
oxidase acitivity. Agents which are identified as having 
5 inhibitory or enhancing effects on the specific activity 
of lysyl oxidase would be desirable for use in specific 
treatments, including wound healing or fibrosis, as 
described above. 

Thus in a preferred embodiment, the invention 

10 provides a method for screening agents which 

substantially enhance or substantially inhibit the 
specific activity of lysyl oxidase or lysyl oxidase -like 
proteins, the method comprising the step of contacting 
the agent with an expression product of the invention and 

15 measuring the specific activity of the expression 
product . 

To the extent that lysyl oxidase and lysyl oxidase - 
like proteins are involved in the oxidation of primary 
amines, the inventor recognises that the expression 

20 products of the invention are useful in assays for 

detecting substrates which contain primary amines. The 
expression products of the invention are also useful to 
quantify amounts of protein or non-protein substrates 
wherein those substrates contain a known number of 

25 primary amine moieties. The expression products of the 

invention are particularly suited to such an assay as the 
expression product is derived from a pure source and will 
therefore specifically detect primary amine groups. 
Furthermore, as the expression products of the invention 

30 may be variants of lysyl oxidase or a lysyl oxidase- like 
protein, the inventor recognises that molecules 
associated with a particular primary amine (for example a 
primary amine group displayed in a specific manner) may 
be specifically detected over other molecules in the 

3 5 sample which contain a primary amine group. The assay 
could be performed according to any of the three 
substrate assays detailed in the accompanying Examples . 
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Thus in a preferred embodiment, the invention 
relates to a method for detecting a protein or non- 
protein substrate in a sample, the method comprising 
contacting the sample with an expression product of the 
5 invention and detecting oxidation of any primary amine 
moieties in the sample. 

In another preferred embodiment, the invention 
relates to a method for quantifying the amount of a 
protein or non-protein substrate in a sample, wherein the 
10 number of primary amine moieties per substrate molecule 
is known, the method comprising contacting the sample 
with an expression product of the invention and detecting 
the amount of oxidation of any primary amine moieties in 
the sample. 

15 in one aspect, the lysyl oxidase, lysyl oxidase -like 

proteins and variants thereof may be used to cross- link a 
primary amine containing molecule such as tropoelastin or 
collagen to the surface, wherein the solid surface has 
primary amine groups which are available for cross - 

20 linking. In this aspect, the invention provides the use 
of lysyl oxidase, lysyl oxidase -like proteins and 
variants thereof as an agents for forming coatings on 
solid surfaces. The present inventor envisages that 
those surfaces which have primary amine groups available 

25 for cross -linking will be useful for the manufacture of 
synthetic tissues for example, blood vessels. 

In one embodiment, it is envisaged that surface or 
devices could be dipped into solutions of tropoelastin 
and/or collagen in the presence of a lysyl oxidase, lysyl 

3 0 oxidase -like protein or variant of the invention. 

In another embodiment, a cross -linked material could 
be preformed and then cross- linked to available primary 
amine groups on a surface, using a lysyl oxidase, lysyl 
oxidase- like protein or variant of the invention. 

3 5 The cross -linked molecule may comprise collagen 

and/or tropoelastin and/ or other cross -linkable primary 
amine containing molecules . 
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In another aspect, the invention provides a tissue 
glue, which is of use in assisting wound healing. This 
glue can be used as a replacement for sutures. The glue 
comprises an expression product of the invention and may 
5 additionally comprise at least one cross- linkable primary 
amine material, for example, tropoelastin and/or 
collagen. The glue may also be used to repair incisions 
in for instance, surgical applications and other body 
tissue defects. The glue may be applied by mixing the 
10 primary amine containing material and expression product 
and applying to the site. 

Other uses for the expression products of the 
invention include the preparation of sheets or films 
where the product can be used as a reinforcement as well 
15 as for its biological properties. Applications could 

include repair of anastomoses, repair of defects in the 
heart. The expression products can also be used to 
prepare solid structures such as beads which may find 
application in tissue bulking applications such as in the 
20 urethra, oesaphagus or in dermatological applications 

where primary amine containing substances are desirably 
deposited in cross -linked form. 

The expression products of the invention are also 
useful as agents for: 
25 - rrg The routes envisaged for the administration of 
this form of cancer therapy include gene and protein 
therapies in which the gene or protein is 
administered in accordance with protocols already 
developed with respect to other molecules proposed 
30 for use in gene or protein therapy. In one 

application category it is proposed that 
extracellular application of lysyl oxidase either 
alone or as part of a delivered complex, could 
reduce the growth and behaviour of cancer and other 
35 cells, including but not limited to ras- transformed 

cells. Lysyl oxidase administration has the 
potential to display the anti- cancer phenotypic 
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potential of drugs such as azatyrosine (Krzyzosiak 
et al., 1992; Contente et al . , 1990). 
Regulating cell growth. Increased lysyl oxidase may 
be considered to mimic some of the functions 
5 mediated by transforming- growth- factor-beta 1 

(Shibanuma et al., 1993). 

Glue to join primary amine materials e.g. in 
surgical or veterinary applications. The expression 
product can be formulated in suitable carriers, buffers 
10 and other conventional delivery systems. This includes 
natural materials, synthetic materials and composites. 

Oxidising accessible lysines (and other amines) to 

modify behaviour of those molecular components. 

Modified lysyl oxidase to perform further reactions. 

15 BRIEF DESCRIPTION OF ACCOMPANYING DRAWINGS 

Figure 1: Entire coding sequence (SEQ ID N0:1) for 
an amino acid sequence (SEQ ID NO: 2) of a secreted form 
of human lysyl oxidase. A potential signal sequence (1 
to 21 aa) is underlined (Mariani et al., 1992; Hamainen 

20 et al., 1991; Svinarich et al . 1992). 

Figure 2: DNA sequence (SEQ ID NO: 3) of a 
synthetic polynucleotide SHLOX encoding the amino acid 
sequence (SEQ ID NO: 4) of a lysyl oxidase. Restriction 
sites at the boundaries of each block are identified; 

25 Figure 3 : Comparison of the synthetic sequence SHLOX 

with the natural sequence (GenBank HUMLOX coordinates) . 
HUMLOX is on the upper line and SHLOX on the lower line, 
in each case; 

Figure 4: Sequences of synthetic oligonucleotides 

30 for SHLOX construction (5 1 is at the left of each 

sequence, and 'p' denotes phosphate). (SEQ ID NOS:5-30) 

Figure 5: Measure of lysyl oxidase activity as 
indicated by the production of tritiated water (TCA 
precipitated counts) . 

35 Figure 6: Measure of lysyl oxidase activity as 

indicated by the production of tritiated water. 
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BEST METHOD OF C ARRYING OUT THE INVENTION 

Design of a synthetic lvs yl oxifiasp qpiip 

Human lysyl oxidase is typically made in human cells 
as a longer form, which is proteolytically processed to 
generate a shorter protein secreted into the 
extracellular space. The amino acid sequence has been 
compiled from data from different sources: complete 
sequence data which was available for cDNA encoding human 
lysyl oxidase (Hamalainen et al . , 1991; Mariani et al . , 
1992) , as well as partial human genome sequence 
(Svinarich et al . , 1992). Analytical comparison of the 
cDNA sequences showed a discrepancy in amino acid 
composition which required elucidation before the 
synthetic gene could be made, namely Alal36 (GCT) was in 
conflict with Argl36(CGT) of Hamalainen et al . , (1991). 
The present inventor dealt with this by considering the 
genomic data, and analysing homologous regions of lysyl 
oxidase from different species, including rat, mouse, 
bovine and chicken. The choice made was later confirmed 
(Hamalainen et al., 1993) supporting the correction that 
had been made in generating the synthetic gene. The 
synthetic gene was designed with strategically placed 
restriction sites to facilitate conversion to novel 
derivatives and counterparts of lysyl oxidase and lysyl 
oxidase -related forms (Kim et al . , 1995). The synthetic 
gene was designed with a codon usage pattern favourable 
to high level expression in E. coli (Figure 2) . 

As lysyl oxidase is a secreted protein its cDNA 
derived sequence encodes a region for a signal peptide, 
which would not be useful for generating a functional 
enzyme in E. coli but rather for directing eventual 
transfer of the newly translated protein from inside the 
cell into the extracellular matrix. Furthermore the 
secreted pro- lysyl oxidase was thought to be cleaved by 
extracellular protease (s) to process the mature protein, 
yet it was not known if the pre- cleaved protein was 
active. There were doubts regarding the precise 
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boundaries of the presumably removed signal peptide and 
propeptide ("leading peptide"). The likely leading 
peptide sequence in the N- terminal part of the lysyl 
oxidase sequence was determined by computer analysis and 
5 from the definitions provided by others (Trackman et al., 
1990; Hamalainen et al., 1991) and was excluded from gene 
synthesis. Subsequent to this, further modifications were 
made to adjust for varied N- terminal sequences, various 
candidate signal sequences and eventually a better 

10 estimate of the leading peptide sequence, to generate 
derivatives of SHLOX, as detailed below. 

Initial gene construction and cloning was conducted 
as described by Martin et al . (1995) . The lysyl oxidase 
sequence of Figure 1 was backtranslated according to a 

15 table of average codon frequencies for a group of highly 
expressed E. coli genes (GCG package version 7-UNix using 
CodonFrequency and GenRunData: ecohighcod) . The 
resulting codon sequence lacked rare codons. 
Modifications were made to the refined sequence to allow 

20 for convenient construction, cloning and analysis by 
adding short artificial sequences to each end of the 
codon- optimized gene to provide BamHI- compatible cohesive 
termini, an Ncol site near the 5' end of the gene, and 
three stop codons at the end of the open reading frame. 

25 Restriction sites were introduced into the synthetic 
coding sequence at 270 - 300 base pair intervals, 
dividing the gene into 4 sections of a manageable size 
for individual assembly, cloning and sequence analysis. 
Each section was subdivided into pairs of 

30 complementary oligonucleotides. Internal junctions 
between oligonucleotide pairs were delineated by 
nucleotide overlaps, whilst the external junctions fall 
at the restriction enzyme cleavage points. The codon- 
optimized sequence and restriction sites are shown in 

3 5 Figure 2, together with the encoded amino acid sequence. 
Oligonucleotide sequences and designations are shown in 
Figure 4 . 
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Oligonucleotide pairs 2 to 6 were made by the Sydney 
University Macromolecular Analysis Centre, Australia and 
subsequently purified by denaturing polyacrylamide gel 
electrophoresis (PAGE) as follows. Oligonucleotides 
5 (>85-mers) were run on PAGE containing 7M urea and 8% to 
10% acrylamide (acrylamide :bisacrylamide 19:1) . Samples 
were heated to 75 °C for 3 minutes before loading. 
Tracking dye (0.05% bromphenol blue, 0.05% xylene cyanole 
FF in deionized formamide) was loaded into an adjacent 

10 lane and electrophoresis conducted at constant power of 
25 W (ca. 300 V) until the bromphenol marker was within 
lcm of the base of the gel. Product bands were 
visualized by UV- shadowing over a fluorescent TLC plate. 
Gel fragments containing purified oligonucleotides were 

15 excised and transferred to microcentrifuge tubes, crushed 
and soaked overnight at 60 °C in 500^1 elution buffer 
(0.3M sodium acetate pH7.0). A second extraction was 
performed with 400/il elution buffer, for 3 to 4 hours at 
60°C and the supernatant combined with that of the first 

2 0 extraction. The total volume of the DNA- containing 

solution was reduced to approximately 400^1 by extraction 
with n-butanol and DNA was precipitated with lml 96% 
ethanol . Purified oligonucleotides were collected by 
centrifugation, redissolved in 20/xl TE buffer and 

25 quantified by spectrophotometry. The final yield of 
oligonucleotide was typically 5 to 30 fig. 
Oligonucleotides 1 and 7 to 13 were synthesised and 
purified by Macromolecular Resources, USA. 

Complementary oligonucleotide pairs (l^g each, 

30 approximately 30pmol for 95-mer) were annealed in 10^1 
buffer containing 50mM Tris.HCl pH7.5, lOmM MgCl 2 . The 
mixture was overlaid with 12/xl paraffin oil, heated to 
95°C and cooled to 16°C over I6h in a microprocessor- 
controlled heating block (Perkin Elmer Cetus Thermal 

35 Cycler) . Annealed samples were transferred to clean 

microcentrifuge tubes and a small aliquot (1/xl) withdrawn 
for analysis by agarose gel electrophoresis. 
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For each block comprising three complementary 
oligonucleotide pairs (blocks 1-3) , four separate 
reactions were set up (A+B, A+C, B+C and A+B+C) . Each 
reaction contained 50mM Tris.HCl, lOmM MgC^, lmM ATP, 
5 3mM DTT, 3/il each of the appropriate annealed 

oligonucleotide pairs, 0.5^x1 (0.5U) T4 DNA ligase and 
Milli-Q water to a total volume of 10/zl. All ingredients 
except ATP, DTT and T4 ligase (Boehringer Mannheim, 
Germany) were mixed and heated to 45 °C for 5 minutes to 

10 denature cohesive termini and then cooled to room 

temperature before addition of the remaining components. 
Ligation reactions were incubated overnight at 16 °C and 
analysed on 2% LMT agarose gels, and ligated blocks were 
purified on preparative 2% LMT agarose gels with TAE 

15 running buffer. Product bands were identified under 
long-wave UV illumination with reference to known DNA 
size -standards (pBSHSK+ digested with either Haelll or 
AluIJ (Statagene, USA) and excised in the minimum 
possible volume of gel. DNA was recovered from LMT 

20 agarose fragments using GELase or agarase. Block 4 was 
constructed similarly, except that four instead of three 
complementary oligonucleotide pairs participated in the 
ligation reaction. Purity and yield of recovered SHLOX 
blocks was assessed by analytical agarose electrophoresis 

25 alongside known DNA standards. 

pBSIISK+ (Stratagene) DNA was digested with 
appropriate restriction enzymes (Figure 2) . Inserts were 
purified at each stage by preparative gel 
electrophoresis. Plasmid DNA was isolated from agarose 

30 gels (Prep-A-Gene, Bio-Rad) . Approximately lOOng (ca. 
0.05pmol) purified plasmid fragment was added to 50ng 
(ca. 0.3pmol) synthetic block in 17/il buffer containing 
50 mM Tris.HCl pH 7.5, lOmM MgCl 2 and the solution heated 
at 45°C for 5 minutes to denature cohesive termini. Upon 

3 5 cooling to room temperature, 2/il lOmM ATP, 30mM DTT and 
Ifil T4 DNA ligase (1U) were added and the reaction 
incubated overnight at 16 °C. The mixture was heated for 
10 minutes at 75°C to inactivate T4 ligase, and DNA was 
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precipitated by adding 2.5/xl 3M sodium acetate and 250/zl 
ethanol to the 20itl volume of the ligation mixture. The 
pellet was resuspended in a final volume of 10 /xl TE, pH 
8.0. Of this solution, 1 to 2 /xl was used to 
5 electroporate 40/xl XLlBlue competent cells (Bio-Rad 
electroporation Gene Pulser and Pulse Controller; 
2500V/0 . 05sec) . Electroporated cells were diluted with 
lml LB media immediately after the procedure, incubated 1 
hour at 37°C and 100/zl of this was spread on agar plates 

10 containing 100/xg/ml ampicillin (LB plus ampicillin) , 

O.lmM IPTG and 80/xg/ml X-gal. Recombinants were selected 
on the basis of a white bacterial colony phenotype, 
although some blue colonies were used as controls. 
Plasmid DNA was screened by restriction digestion and 

15 confirmed by sequencing. Plasmids containing the four 
lysyl oxidase blocks were designated pSHLOXl-4. 

All candidate clones were screened and the mutants 
discarded, unless they contained mutations of potential 
value in future SHLOX studies. Plasmids bearing correct 

20 sequences were grown in E.coli XLlBlue cells (Bullock et 
al 1987) , purified by CsCl - ethidium bromide density 
gradient centrif ugation and confirmed by sequencing in 
both directions. 

In two separate reactions, adjacent pairs of gene 

25 blocks 1+2 and 3+4 were ligated in pBSIISK+. These were 
designed to give recombinants a and &, respectively. 
Each block was excised from the appropriate plasmid and 
purified through agarose gel electrophoresis. 50ng each 
synthetic block and lOOng appropriately restriction 

3 0 enzyme cleaved complementary- tailed pBSIISK+ digest were 
ligated in a total of 20/xl T4 ligase buffer in the 
presence of 1U T4 DNA ligase overnight at 16 °C. 
Transformants were screened by the colony white 
phenotype, comparative electrophoretic mobility of the 

35 intact plasmids in 0.7% TBE agarose gel and by 

restriction digestion. After verification of the primary 
structure by sequencing, pSHLOXa and pSHLOXS were used 
for the final assembly of the pSHLOX gene. 
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pSHLOXa and pSHLOXS plasmids were cut and the 
appropriate fragments containing a and E subassemblies of 
the intact gene of lysyl oxidase were purified from an 
agarose gel and ligated. Transf ormants were selected 
5 from LB plus ampicillin plates containing IPTG and X-gal. 
Correct constructs were confirmed by sequencing. The 
intact 1188bp synthetic gene encoding lysyl oxidase 
(SHLOX) was used for all further manipulations. The 
construct Containing full length SHLOX (Figure 2) in 
10 pBSIISK+ was designated pSHLOXB. 

The construction of SHLOX is exemplary of the 
methods used with other genes . 

Construction of derivatives (including shorter forms) and 
placing in various E. coli vectors for production 

15 1^ PGEXL0X-2T 

The full length SHLOX gene was excised from pSHLOXB 
with BamHI and purified by gel electrophoresis and GELase 
(Epicentre Technologies, USA) . Purified fragment (lOOng) 
was ligated with 50ng pGEX-2T (Medos Co Pty Ltd) in lOfil 

20 total volume of ligase buffer' using 1U of T4 DNA ligase. 
The vector DNA had been previously linearised with BamHI 
and treated with calf intestinal phosphatase (CIP; 
Boehringer Mannheim, Germany) . Transf ormants were 
selected on LB+ampicillin plates, screened by restriction 

25 digestion and confirmed by DNA sequencing. The SHLOX gene 
cloned into pGEX-2T was designated pGEXLOX-2T. 

2^ PGEXLOX 1 " 1 

Sail and Xhol mutations were introduced (Clontech 
Transformer mutagenesis kit) into pGEXL0X-2T at positions 

30 -13 to +23 and 421 to 442 of the SHLOX gene using two 

mutagenic primers (Xhol: CTG GTT CCG CGT GGA TCC CTC GAG 
CAG CCG CCG CGT; Sail: ATG GTT GGT GTC GAC CCG TAC A) 
(Beckman Oligofax Service, Sydney) . The 424 bp DNA 
fragment between the introduced Sail and Xhol sites was 

35 excised from the 5' -terminal region of SHLOX of pGEXLOX- 
2T by cutting with Sail and Xhol and the remaining 
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plasmid DNA was gel-purified. Sail and Xhol compatible 
ends were ligated. Apart from GST, the resulting 
construct pGEXL0X ml was designed such that following 
thrombin cleavage of the expressed protein it would 
5 imitate a 'mature' SHLOX sequence, encoding a 3 0kDa 

protein with similar size and primary structure to the 
natural extracellular form of lysyl oxidase (Cronshaw 
et.al., 1995). Following sequence confirmation and 
further transformation, expression was in E.coli DH5a. 
10 The resulting expressed protein was designated GST -SHLOX. 



p3aL mi and pl2aL mi 

pGEXL0X ml was treated with BamHl to release the 
coding sequence SHL0X ml (the truncated form of SHLOX) . 
This was ligated, in each case, into the BamHI site of 

15 pBT3a and pET12a plasmid vectors (Novagen Inc, USA) and 
the resulting constructs, p3aL and pl2aL m ' were 
transformed into E.coli HMS174 (Studier et al 1990) . 
Transf ormants were plated and screened. Orientation of 
inserts was confirmed by restriction digestion and the 

20 regions around the cloning sites by sequencing. Following 
further transformation, expression was in E.coli BL21/DE3 
(Studier et al 1990) cells for p3aL ml and pl2aL ml . The 
resulting expressed proteins were designated 3aL ml and 
12aL ml , respectively. 

25 iL_ p3DLox 

Full length SHLOX was gel -purified from pSHLOXB 
after BamHI digestion. pET3d vector was linearised with 
BamHI and dephosphorylated with CIP. Ligation of lOOng 
treated vector and 50ng purified SHLOX gene with T4 DNA 

30 ligase was conducted in 20fil volume at 16°C for 20h, and 
the ligation mixture was used to electroporate E.coli 
HMS174 . The recombinant bearing a SHLOX insert in the 
correct orientation was treated with Ncol to release the 
842bp fragment consisting of 39 bp of pET3d and 813bp 
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SHLOX. Following gel purification, the resulting linear 
plasmid was recircularised and cloned in E.coli HMS174. 
Clones were sequenced. The verified construct (designated 
p3DLox) contained the 297bp SHLOX 3 '-terminal region 
5 encoding a 13,653 Da polypeptide (designated Dlox) 
corresponding to a carboxyterminal sequence of human 
lysyl oxidase . 

5^ oSdL 1 " 2 

Sequences of both the 3aL ml and the 12aL ml 

10 recombinant polypeptides were not considered to precisely 
reflect the primary structure of mature human lysyl 
oxidase. Each had N-terminal extensions (of the order of 
14 residues for 3aL ml and 24 residues for 12aL ml ) 
originating as a consequence of cloning and derived from 

15 their vector sequences. These were foreign to mature 
human lysyl oxidase. To minimise the possible 
interference of these extensions on the properties and 
structure of the recombinant mature protein, it was 
considered worthwhile to place the translational start 

20 site as close as possible to aspartate^ of SHLOX. 

Accordingly use was made of an internal methionine^ 
residue fortuitously located just three residues upstream 
of aspartate comprising the proposed N- terminus of the 
processed protein. Although there was no convenient 

25 recognition site in this region, and in particular no 

Ncol or Ndel endonuclease restriction site in this part 
of the SHLOX sequence, an appropriate modification was 
planned by site-directed mutagenesis. The plasmid chosen 
for mutagenesis was pGEXSHLOX. pGEXSHLOX is identical to 

3 0 pGEXL0X-2T. An Ncol primer was designed (CCC GTG TTG ACG 
CCA TGG TTG GTG) which generated two nucleotide changes 
to transform the GGT ATG GTT target sequence into GCC ATG 
GTT. After generating the mutation, the resulting 
construct had two jNcoI sites; the second (downstream) 

35 site was already in the SHLOX sequence. This offered a 
simple clone construction scheme. After screening, the 
mutant clone was used to prepare purified plasmid by 
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CsCl - ethidium bromide density gradient centrifugation. 
From this DNA the 396 bp fragment, flanked by Ncol sites 
at positions 419 to 424 and 815 to 820 of the mutated 
SHLOX, was excised. This fragment was then inserted into 
5 pDLox. The orientation and primary structure of the 

recombinant was confirmed by sequencing. The resulting 
construct, designated p3dL m2 , was electroporated into 
E.coli BL21/DE3. The inventor recognises that the Asp 
144 form can be generated by, for example, treatment of 
10 any of the pGEXLOXml or p3aLml or pl2aLml forms with an 
appropriate protease, for example procollagen C- 
proteinase . 

6 . Yeast constructs 

Various constructs were made for expression in 

15 yeast, including pCNNinv (Carlsberg Research 

Laboratories) and pYEX-BX { AMRAD) . Pichia was chosen as 
an alternative expression host. For pCNNinv, a form of 
the synthetic gene corresponding to mature secreted lysyl 
oxidase was generated by PCR from pl2aL ml , using primers 

20 which included at their 5' termini Eagl sites. The 

amplification product was purified from an agarose gel 
using agarase, ligated to pGEM-T and transformed into E. 
coli XLl-Blue cells. Nine colonies from 16 candidate 
clones generated DNA that upon cutting with EagI 

25 contained bands of approx. 750 and 3004 bp. Two colonies 
were selected for further investigation. Manual 
sequencing of one of the two recombinants showed a 
deletion. The other clone was correct over the sequenced 
region. A Qiagen (GmbH, Germany) preparation of DNA from 

3 0 this clone was cut with EagI in parallel with pCNNinv. 
Both forms of DNA were purified using agarase. The cut 
pCNNinv was treated with calf intestinal alkaline 
phosphatase and purified by phenol/chloroform extraction. 
The vector and insert were ligated and transformed into 

3 5 E. coli XLl-Blue cells. Twenty colonies were screened 

using Hindlll, and two colonies appeared to have inserts 
in the correct orientation. Spheroplasts of 
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Sac char omyces cerevisiae strain DBY74 7 (AMRAD) were 
transformed separately with the two constructs, and one 
of these was subsequently chosen for detailed analysis. 
Complete DNA sequencing confirmed the correct sequence. 
5 For cloning into pYEX-BX, the vector was cleaved 

with flamHl and dephosphorylated using calf intestinal 
alkaline phosphatase. Separately, pGEXLOX-2T was treated 
with restriction enzymes NotI and BamHI and the SHLOX 
fragment was purified from an agarose gel. This fragment 

10 was ligated to the linearised vector, after which ends 
were filled using deoxynucleoside triphosphates and 
Sequenase v2 . 0 (Amersham) . Blunt end ligation was then 
carried out. Correct constructs were identified by 
restriction enzyme digestion followed by complete 

15 sequencing of the fragment and adjacent vector sequences. 
All restriction enzymes and agarase used in this 
study were obtained from Boehringer Mannheim, Germany. 

Preparati on of various soluble proteins 

Growth of E. coli cultures for recombinant lvsvl oxidase 

20 preparation 

A single colony from an agar plate was grown 
overnight with gyrotary shaking in 3ml LB + ampicillin 
media, and the entire culture was used to inoculate 250ml 
2TY medium containing 70figmr l ampicillin in a 1L conical 

25 flask. The inoculated medium was grown at 37°C with 

vigorous shaking (280-290rpm) until OD^qq "0.6-0.8 (about 
2h) upon which expression was induced by the addition of 
IPTG (usually 0.4mM IPTG for DLox, 3aL ml and 3dL m2 ; O.lmM 
IPTG for GST- SHLOX) . Prior to induction a 0.5ml aliquot 

30 of the cell culture was withdrawn for analysis by SDS- 
PAGE and to confirm the absence of inclusion bodies by 
phase contrast microscopy. The induced cell culture was 
incubated under the same conditions for a further 2.5 to 
3 hours then transferred to 4°C for 30 minutes before 

35 harvesting. At this point a 4pl aliquot was taken for 

analysis by phase contrast microscopy for the presence of 
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inclusion bodies. If inclusion bodies were present, this 
was considered a reliable indicator of successful 
expression. Otherwise expression was monitored by SDS- 
PAGE of total bacterial protein alongside a sample 
5 collected before induction. 

The culture (250 ml, 4°C) was divided into 50 ml 
aliquots in polypropylene tubes and cells were harvested 
by centrif ugation (10 minutes, 5,000 rpm, 4°C) . The 
supernatant was thoroughly removed and the wet pellets 
10 were weighed. A typical yield was 0.5 to lg packed wet 
cells per 50 ml culture. Pellets were either stored at 
-80°C for several months, or used immediately for 
inclusion body purification. 
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Purification of inclusion bodies 

Thawed pellets were thoroughly resuspended on ice in 
"10 vol/weight of lysis buffer (50mM TrisHCL pH8.0 # lmM 
EDTA, lOOitiM NaCl (usually 5 to 7 ml per tube) and lysed 
5 at 4°C with lmg/ml lysozyme (added in the form of a 10- 
times solution in lysis buffer) for 30 minutes with 
constant mixing. Mixing was achieved by continuous 
mechanical inversion on a vertical platform at 30 to 40 
rpm. To inhibit putative subsequent protease activity 

10 PMSF was then added to 0.5 mM. Quantitative cell 

disruption was achieved by incubation with 1% Triton-XlOO 
(4°C, constant mixing, 30 minutes) . Cell lysates showed 
high viscosity because of released bacterial chromosomal 
DNA. To reduce this effect DNA was digested until visual 

15 loss of viscosity with 0 . lmg/ml DNase in the presence of 
lOmM MgCl 2 with constant mixing (typically 20 to 30 
minutes at 4°C) . Viscosity was further reduced by 
sonication using three bursts of 10 seconds each in tubes 
immersed in ice. The extent of lysis was checked by 

20 phase contrast microscopy and if necessary the lysis time 
was extended and/or further sonication applied. 

After centrifugation (typically 5,000 rpm, 0° to 
4°C, 2 0 minutes) the supernatant was carefully decanted 
and residual drops of supernatant were removed with a 

25 tissue, taking care not to disturb the pellet. Ice-cold 
lysis buffer (volume equal to that of the the discarded 
supernatant) was mixed with the pellet containing 
inclusion bodies, and the pellet was dispersed either by 
repeated passage through a blue Gilson Pipetman or by 5 

3 0 to 6 seconds sonication with a microtip at nearly maximal 
setting . 

A second centrifugation was similarly performed and 
the supernatant was discarded. The washed pellet was 
resuspended in an equal volume of ice-cold 2M urea in 
35 lOmM potassium phosphate buffer, pH 8.2, and incubated 
for 10 minutes with constant mixing at 4°C, followed by 
centrifugation at 5,000 rpm, 4°C, 20 minutes to collect 
purified inclusion body fraction in the separated pellet. 
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This last step usually extracted only a minor fraction of 
protein from inclusion bodies as judged by SDS-PAGE. The 
pellet was kept cool until solubilisation, and never 
frozen. Storage of the inclusion body fraction was 
5 {unless frozen) in the presence of 6M urea at 4°C. 

Solubilisation of inclusion body vrotein 

Solubilisation was achieved immediately after 
purification using ice-cold U6KP (6M urea in 10 mM 
potassium phosphate buffer, pH B.2). The solution was 

10 autoclaved and stored at room temperature prior to use. 

The amount of U6KP used for solubilisation determined the 
eventual concentration of recombinant lysyl oxidase. 
Typically 5 ml U6KP was used to solubilise the pellet 
derived from 50 ml bacterial culture, to yield 2 to 

15 3mg/ml total protein. Mercaptoethanol was added to 2 to 
3 mM. To help solubilisation mechanical disruption with 
a plastic Gilson tip was initially tried but this often 
led to loss of material which adhered to the inside and 
outside of the tip. Instead, the mixture was usually 

20 mechanically mixed by continuous mechanical inversion on 
a vertical platform at 30 to 40 rpm overnight, or until 
there was no sign of clumps. At all times the 
temperature was kept ~0° to 4°C. 

The total protein concentration was estimated by 

25 Bradford assay, and CuCl 2 was added at "5 to 10 times 
molar excess over estimated protein. This was usually 
200/iM CuCl 2 . Solubilised protein stocks were used 
directly for protein analysis or for refolding of the 
protein by dialysis in various buffers, including 

30 phosphate buffers, depending on the nature of the 
experiment . 

Refolding of inclusion hndi es bv dialysis against 
Dhosnhat.R huffer 

For dialysis against phosphate buffer 10 mm and 32 
35 mm dialysis tubing (Selby, Australia) was usually used. 
Cut tubing was boiled in MilliQ water for 10 minutes and 
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briefly soaked in dialysis buffer prior to use. lOmM or 
16 mM phosphate buffer was prepared from 1M KH 2 P0 4 and 1M 
K2HPO4 solution stocks and chemical grade distilled water, 
with the occasionally pH adjusted with 1M KOH. 
5 Dialysis of 2 or 3 ml of a 6M urea solution of 

recombinant protein (from 2 or 3 tubes, respectively) was 
routinely performed against 2 L urea-free buffer, with 
buffer changes after each dialysis. Copper-containing 
samples were first dialysed against buffer containing an 

10 approximately equimolar concentration of CuCl 2 (usually 

10-50 fiM) , in the expectation that removal of urea would 
be accompanied by integration of copper and bias towards 
a conf ormationally relevant form of the protein. Excess 
of copper was then removed in the following dialysis. 

15 After two consecutive dialyses (one against a 

copper-containing buffer, and the other against a 
copper-free buffer at pH 9.6) the dialysed material was 
transferred into polypropylene tubes on ice. Soluble and 
insoluble fractions were separated by centrifugation and 

20 analysed by SDS-PAGE. The pellet and the supernatant 

obtained after dialysis was stored at 4°C without obvious 
protein degradation for at least 3 weeks, but some 
solubility changes were noticed during storage. 

Protein from S. cerevisiae cultures 

25 To obtain lysyl oxidase in yeast cell pellets and 

lysates, a colony was grown in CSM-leu for 4 8 hours. To 
purify secreted protein, cells were separated from growth 
media by centrifugation, and the supernatant was dialysed 
against ice-cold U6KP. Cells were washed in 5mM KPB 

30 (retained) then resuspended in 5mM KPB plus lyticase. 

After incubation for 1 hour at 30°C, cells were sonicated 
until apparent lysis as detected by phase contrast 
microscopy. The pellet and lysate were separated by 
centrifugation. The pellet and lysates were washed with 

3 5 U6KP and dialysed separately against ice-cold U6KP then 
against KPB, pH 9.6. 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06830 



PCT/AU97/00505 



- 48 - 



10 



PROTEIN RTTmTKG 

To confirm the identity of the expressed protein, in 
each case DNA sequences were determined. Purified soluble 
recombinant lysyl oxidase forms were all examined by 
SDS-PAGE, electrospray mass spectrometry, amino acid 
analysis and N- terminal protein sequencing to confirm the 
identity of the overexpressed protein. Detailed analyses 
of full-length protein demonstrated the following: 
(1) the protein bound copper at ~ l mole Cu 2+ per mole 
protein; this assay was performed by atomic absorption 
spectroscopy on a Varian SpectrAA-10. Protein samples 
were examined for protein content using a Bradford assay 
at 595 nm, and the ratio molar content of copper to 
protein was determined. 
15 (2) the protein exists as multimers in solution; multimer 
formation was observed by column chromatography in 
experiments using either Sephadex G100 or G200, compared 
with protein standards. Aliquots were examined 
spectrophotometrically at 280nm. Light scattering 
studies (Malvern Instruments S4700 version PCS v 1.26) 
showed size distributions, including approximately 
spherical particles at 30 to 56 nm. 
(3) multiple peak formation by ion-exchange 
chromatography; multiple peaks were observed for 3dLm2 
25 when analysed on a 100 nm x 4.6 mm HperD Q column on a 
BioSys2000 system (Beckman, USA) , using a linear NaCl 
gradient from ) to 0.4 M NaCl, over 9 min in the presence 
of 6M urea and 16 mM potassium phoshpate buffer, at 
280nm. 

3 0 (4) organic cof actor formation as evidenced by 

redox-cycling (Paz et ai . , 1991), covalent radiolabel 
attachment of 14-C-ethylenediamine, and Raman 
spectroscopy of the p-nitrophenylhydrazone 
(5) enzyme activity including conversion of recombinant 

35 tropoelastin to aqueous- insoluble material. 

In the yeast expression product studied 
characteristics 2 and 5 have been verified indicating 
that functional protein is being produced while all 5 
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characteristics have been verified for E.coli expression 
particularly with respect to the pSdL 1 " 2 construct. 

Levels of expression obtained have been 
approximately 30% of total cell protein in E. coli and 
5 10% of total cell protein in yeast. 

Benzylamine substrate assay 

The assay was performed according to the method of 
Trackman et al . 1919. The assay monitors at 250nm the 
production of benzylaldehyde from benzylamine in 
10 potassium phosphate buffer at 37°C. Lysyl oxidase 

activity is measured spectrophotometrically at 37°C by 
the conversion of benzylamine to benzaldehyde at 2 5 0nm. 

Three samples containing lysyl oxidase and benzylamine 
showed a slight increase in absorbance at 250nm. In this 
15 assay the lysyl oxidase was derived from the plasmid 
pSdL 1 "'' . Controls, including boiled lysyl oxidase and 
benzylamine, benzylamine and dialysis buffer (no enzyme) 
and lysyl oxidase and KPB (no substrate) showed no 
increase in absorbance at 250nm. 

Averaging the change in absorbance at 250nm of the three 
lysyl oxidase and benzylamine samples and estimating the 
concentration of lysyl oxidase in each assay the specific 
activity of the enzyme was calculated at 0.0145 
umol/min/mg. Wang et al . reports a specific activity of 
0.019 U/mg. 

Tropoelastin substrate assay 

Tropoelastin resuspended in either KPB (10mM,pH8) or PBS 
(lOmM PB l50mM NaCl) was mixed at a final concentration 
of lOmg/mL, with lysyl oxidase and placed at 37°C for 2 
hrs. In this assay, the lysyl oxidase was derived from 
the pSdL 1 " 2 plasmid. Controls included tropoelastin + 
lysyl oxidase + BAPN, tropoelastin + boiled lysyl 
oxidase, tropoelastion + dialysis buffer, tropoelastin + 
buffer. 



20 
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Equal amounts from each reaction were run on an 8% 
SDS-PAGE. On staining with Coomassie less intense 
tropoelastin bands were seen in the tropoelastin in KPB + 
LO and tropoelastin in KPB + lysyl oxidase + BAPN sample 
5 lanes compared to other lanes. This may be indicative of 
the presence of crosslinked tropoelastin which is too 
large to enter the gel. Whilst BAPN should inhibit lysyl 
oxidase activity, preincubation of the enzyme with BAPN 
for at least 2 hours at 37°C may be required prior to 

10 assaying in order for the inhibition to be effective. 

The above 2 samples and the tropoelastin + buffer 
control were centrifuged, the supernatant removed and the 
precipitate sent for amino acid analysis. The 2 samples 
showed a decrease in percentage of total lysine compared 

15 to the control. Tropoelastin + buffer contained 6% 
lysine, tropoelastin + lysyl oxidase 2.5% lysine and 
tropoelastin + lysyl oxidase + BAPN 2.2% lysine. This is 
consistent with the removal of lysine residues to form 
cross -links such as desmosines. Bedell- Hogan et al 

20 reports the presence of 5.6% lysine residues in 

recombinant tropoelastin compared to 2.4% in alkali 
insoluble recombinant tropoelastin. 

In summary, the results show a lowering of the 
percentage of lysine in tropoelastin, as determined by 

25 the amino acid analysis. Furthermore, an insoluble 

protein was produced, consistent with the production of 
an elastin or elastin-like product. 

Tritiated t ropoelastin assay 

Recombinant tropoelastin was prepared essentially as 

30 described Martin et al . 1995, except that BL21 cells 

containing pSHELF (WO 94/14958) were grown in "complete 
synthetic medium without lysine" (CSM-Lys) , and 
supplemented with L- [4 , 5-tritium] lysine monochloride 
(Amersham) at 83 Ci/mmol. Accordingly, the resulting 

35 tropoelastin was radiolabelled to approximately 

40,000,000 cpm/mg protein. This was lyophilised, then 
redissolved in assay buffer (0.1M Na 2 B 4 0 7 , 0.15M NaCl, 
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pH8) and centrifuged through Millipore Ultrafree MC for 1 
hour at 7500 rpm at 4°C. The retentate was used for 
subsequent assays . Assays were then performed by 
incubation with lysyl oxidase, and assessed according to 
5 the protocol of Shackelton and Hulmes, 1990. In this 
assay, the lysyl oxidase was derived from the p3dL m2 
plasmid. Tritiated tropoelastin was mixed with lysyl 
oxidase and left at 37°C for 16 hours. Tritiated water 
formed during the incubation, an indication of lysyl 

10 oxidase activity, was isolated by ultrafiltration and 
radioactivity quantified by liquid scintillation 
spectrometry. A control for endogenous lysyl oxidase - 
like activity in E. coli was generated by taking an E. 
coli strain containing the pET3d plasmid through the same 

15 purification procedure as that used to produce lysyl 

oxidase. A sample of this pET3d control was also mixed 
with tritiated tropoelastin. Preliminary results are 
consistent with some lysyl oxidase -like activity. In all 
cases samples containing lysyl oxidase produced more 

20 tritiated water than those containing pET3d (Figure 5) . 

INDUSTRIAL APPLICABILITY 

The genetic constructs of the present invention are 
useful in the preparation of high yields of biocompatible 
biospecific lysyl oxidase useful in vivo applications and 

25 other situations where catalysis of amine oxidation is 

required. The expression products of the invention are 
useful as agents for crosslinking of (but not confined 
to) tropoelastin and collagen both in vitro and in vivo, 
to join primary- amine materials e.g. in surgical or 

30 veterinary applications including natural and artificial 
materials, regulating cell growth, interfering 
with/promoting fibrosis, oxidising accessible lysines 
(and other amines) to modify behaviour of those molecular 
components and as components of assay systems designed to 

3 5 detect/quantify amines. Further, the genetic constructs 
and expression products of the invention have potential 
in cancer therapy. 
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(A) LENGTH: 1251 base pairs 

(B) TYPE: nucleic acid 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
ATGCGCTTCG CCTGGACCGT GCTCCTGCTC GGGCCTTTGC AGCTCTGCGC GCTAGTGCAC 
TGCGCCCCTC CCGCCGCCGG CCAACAGCAG CCCCCGCGCG AGCCGCCGGC GGCTCCGGGC 
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GCCTGGCGCC 


AGCAGATCCA 


ATGGGAGAAC 


AACGGGCAGG 


TGTTCAGCTT 


GCTGAGCCTG 


180 


GGCTCACAGT 


ACCAGCCTCA 


GCGCCGCCGG 


GACCCGGGCG 


CCGCCGTCCC 


TGGTGCAGCC 


240 


AACGCCTCCG 


CCCAGCAGCC 


CCGCACTCCG 


ATCCTGCTGA 


TCCGCGACAA 


CCGCACCGCC 


300 


GCGGCGCGAA 


CGCGGACGGC 


CGGCTCATCT 


GGAGTCACCG 


CTGGCCGCCC 


CAGGCCCACC 


360 


GCCCGTCACT 


GGTTCCAAGC 


TGGCTACTCG 


ACATCTAGAG 


CCCGCGAACG 


TGGCGCCTCG 


420 


CGCGCGGAGA 


ACCAGACAGC 


GCCGGGAGAA 


GTTCCTGCGC 


TCAGTAACCT 


GCGGCCGCCC 


480 


AGCCGCGTGG 


ACGGCATGGT 


GGGCGACGAC 


CCTTACAACC 


CCTACAAGTA 


CTCTGACGAC 


540 


AACCCTTATT 


ACAACTACTA 


CGATACTTAT 


GAAAGGCCCA 


GACCTGGGGG 


CAGGTACCGG 


600 


CCCGGATACG 


GCACTGGCTA 


CTTCCAGTAC 


GGTCTCCCAG 


ACCTGGTGGC 


CGACCCCTAC 


660 


TACATCCAGG 


CGTCCACGTA 


CGTGCAGAAG 


ATGTCCATGT 


ACAACCTGAG 


ATGCGCGGCG 


720 


GAGGAAAACT 


GTCTGGCCAG 


TACAGCATAC 


AGGGCAGATG 


TCAGAGATTA 


TGATCACAGG 


780 


GTGCTGCTCA 


GATTTCCCCA 


AAGAGTGAAA. 


AACCAAGGGA 


CATCAGATTT 


CTTACCCAGC 


840 


CGACCAAGAT 


ATTCCTGGGA 


ATGGCACAGT 


TGTCATCAAC 


ATTACCACAG 


TATGGATGAG 


900 


TTTAGCCACT 


ATGACCTGCT 


TGATGCCAAC 


ACCCAGAGGA 


GAGTGGCTGA 


AGGCCACAAA 


960 


GCAAGTTTCT 


GTCTTGAAGA 


CACATCCTGT 


GACTATGGCT 


ACCACAGGCG 


ATTTGCATGT 


1020 


ACTGCACACA 


CACAGGGATT 


GAGTCCTGGC 


TGTTATGATA 


CCTATGGTGC 


AGACATAGAC 


1080 


TGCCAGTGGA 


TTGATATTAC 


AGATGTAAAA 


CCTGGAAACT 


ATATCCTAAA 


GGTCAGTGTA 


1140 


AACCCCAGCT 


ACCTGGTTCC 


TGAATCTGAC 


TATACCAACA 


ATGTTGTGCG 


CTGTGACATT 


1200 


CGCTACACAG 


GACATCATGC 


GTATGCCTCA 


GGCTGCACAA 


TTTCACCGTA 


T 


1251 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 417 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Arg Phe Ala Trp Thr Val Leu Leu Leu Gly Pro Leu Gin Leu Cys 
1 5 10 15 

Ala Leu Val His Cys Ala Pro Pro Ala Ala Gly Gin Gin Gin Pro Pro 
20 25 30 

Arg Glu Pro Pro Ala Ala Pro Gly Ala Trp Arg Gin Gin lie Gin Trp 
35 40 45 

Glu Asn Asn Gly Gin Val Phe Ser Leu Leu Ser Leu Gly Ser Gin Tyr 
50 55 60 
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Gin Pro Gin Arg Arg Arg Asp Pro Gly Ala Ala Val Pro Gly Ala Ala 
65 70 75 80 

Asn Ala Ser Ala Gin Gin Pro Arg Thr Pro lie Leu Leu lie Arg Asp 
85 90 95 

Asn Arg Thr Ala Ala Ala Arg Thr Arg Thr Ala Gly Ser Ser Gly Val 
100 105 no 

Thr Ala Gly Arg Pro Arg Pro Thr Ala Arg His Trp Phe Gin Ala Gly 
115 120 125 

Tyr Ser Thr Ser Arg Ala Arg Glu Arg Gly Ala Ser Arg Ala Glu Abh 
130 135 140 

Gin Thr Ala Pro Gly Glu Val Pro Ala Leu Ser Asn Leu Arg Pro Pro 
145 150 155 " 160 

Ser Arg Val Asp Gly Met Val Gly Asp Asp Pro Tyr Asn Pro Tyr Lys 
165 170 175 

Tyr Ser Asp Asp Asn Pro Tyr Tyr Asn Tyr Tyr Asp Thr Tyr Glu Arg 
180 185 190 

Pro Arg Pro Gly Gly Arg Tyr Arg Pro Gly Tyr Gly Thr Gly Tyr Phe 
195 200 205 

Gin Tyr Gly Leu Pro Asp Leu Val Ala Asp Pro Tyr Tyr lie Gin Ala 
210 215 220 

Ser Thr Tyr Val Gin Lys Met Ser Met Tyr Asn Leu Arg Cys Ala Ala 
225 230 235 240 

Glu Glu Asn Cys Leu Ala Ser Thr Ala Tyr Arg Ala Asp Val Arg Asp 
245 250 ~ 255 

Tyr Asp His Arg val Leu Leu Arg Phe Pro Gin Arg Val Lys Asn Gin 
260 265 270 

Gly Thr Ser Asp Phe Leu Pro Ser Arg Pro Arg Tyr Ser Trp Glu Trp 
275 280 285 

His Ser Cys His Gin His Tyr His Ser Met Asp Glu Phe Ser His Tyr 
290 295 300 

Asp Leu Leu Asp Ala Asn Thr Gin Arg Arg Val Ala Glu Gly His Lys 
305 310 315 320 

Ala Ser Phe Cys Leu Glu Asp Thr Ser Cys Asp Tyr Gly Tyr His Arg 
325 330 335 

Arg Phe Ala Cys Thr Ala His Thr Gin Gly Leu Ser Pro Gly Cys Tyr 
340 345 350 

Asp Thr Tyr Gly Ala Asp lie Asp Cys Gin Trp He Asp He Thr Asp 
355 360 * 365 

Val Lys Pro Gly Asn Tyr He Leu Lys Val Ser Val Asn Pro Ser Tyr 
370 375 380 

Leu Val Pro Glu Ser Asp Tyr Thr Asn Asn Val Val Arg Cys Asp He 
385 390 395 400 

Arg Tyr Thr Gly His His Ala Tyr Ala Ser Gly Cys Thr He Ser Pro 
405 410 415 

Tyr 
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(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1184 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "nucleic acid" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 



GATCCCAGCA 


GCAGCCGCCG 


CGTGAACCGC 


CGGCTGCTCC 


GGGTGCTTGG 


CGTCAGCAGA 


60 


TCCAGTGGGA AAACAACGGT 


CAGGTTTTCT 


CCCTGCTGTC 


CCTGGGTTCC 


CAGTACCAGC 


120 


CGCAGCGTCG 


TCGTGACCCG 


GGTGCTGCTG 


TTCCGGGTGC 


TGCTAACGCT 


TCCGCTCAGC 


180 


AGCCGCGTAC 


LLUUAl V- X. VJ 


CTGATCCGTG 


ACAAC CGTAC 


CGCGGCCGCT 


CGTACCCGTA 


240 


V_ LUU X \J\J 1 1L 


CTCCGGTGTT 


ACTGCAGGTC 


GTCCGCGTCC 


GACCGCGCGC 


CACTGGTTCC 


300 


AGGCTGGTTA 


CTCCACCTCC 


CGTGCTCGTG 


AAGCTGGTGC 


TTCCCGTGCT 


GAAAACCAGA 


360 


CCGCTCCGGG 


TGAAGTTCCA 


GCGCTGTCCA ACCTGCGTCC 


GCCGTCCCGT 


GTTGACGGTA 


420 


TGGTTGGTGA 


CGACCCGTAC 


AACCCGTACA AGTACTCCGA 


CGACAACCCG 


TACTACAACT 


480 


ACTACGACAC 


CTACGAGCGC 


CCGCGTCCGG 


GTGGTCGTTA 


CCGTCCGGGT 


TACGGTACCG 


540 


GTTACTTCCA 


GTACGGTCTG 


CCGGACCTGG 


TTGCTGACCC 


GTACTACATC 


CAGGCTTCCA 


600 


CCTACGTTCA 


GAAAATGTCC 


ATGTACAACC 


TGCGTTGCGC 


TGCTGAAGAA 


AACTGCCTGG 


660 


CTTCCACCGC 


TTACCGTGCT 


GACGTTCGTG 


ACTACGACCA 


CCGTGTTCTG 


CTGCGTTTCC 


720 


CGCAGCGTGT 


TAAAAACCAG 


GGCACCTCCG 


ACTTCCTGCC 


GTCCCGTCCG 


CGTTACTCCT 


780 


GGGAATGGCA 


CTCCTGCCAC 


CAGCACTACC 


ACTCCATGGA 


CGAATTCTCC 


CACTACGACC 


84 0 


TGCTGGACGC 


TAACACCCAG 


CGTCGTGTTG 


CTGAAGGTCA 


CAAAGCTTCC 


TTCTGCCTGG 


900 


AAGACACCTC 


CTGCGACTAC 


GGTTACCACC 


GTCGTTTCGC 


TTGCACCGCT 


CACACCCAGG 


960 


GTCTGTCCCC 


GGGTTGCTAC 


GACACCTACG 


GTGCTGACAT 


CGACTGCCAG 


TGGATCGACA 


1020 


TCACCGACGT 


TAAACCGGGT 


AACTACATCC 


TGAAAGTTTC 


CGTTAACCCG 


TCCTACCTGG 


1080 


TTCCGGAATC 


CGACTACACC 


AACAACGTGG 


TTCGTTGCGA TATCCGTTAC ACCGGTCACC 


1140 


ACGCTTACGC 


TTCCGGTTGC ACCATCTCCC 


CGTACTAATG 


ATAG 




1184 


(2) INFORMATION FOR SEQ ID NO: 4: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: YES 
(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Ser Gin Gin Gin Pro Pro Arg Glu Pro Pro Ala Ala Pro Gly Ala Trp 
15 10 15 

Arg Gin Gin lie Gin Trp Glu Asn Asn Gly Gin Val Phe Ser Leu Leu 
20 25 30 

Ser Leu Gly Ser Gin Tyr Gin Pro Gin Arg Arg Arg Asp Pro Gly Ala 
35 40 45 

Ala Val Pro Gly Ala Ala Asn Ala Ser Ala Gin Gin Pro Arg Thr Pro 
50 55 50 

lie Leu Leu lie Arg Asp Asn Arg Thr Ala Ala Ala Arg Thr Arg Thr 
65 70 75 80 

Ala Gly Ser Ser Gly Val Thr Ala Gly Arg Pro Arg Pro Thr Ala Arg 
85 90 95 

His Trp Phe Gin Ala Gly Tyr Ser Thr Ser Arg Ala Arg Glu Ala Gly 
100 105 110 

Ala Ser Arg Ala Glu Asn Gin Thr Ala Pro Gly Glu Val Pro Ala Leu 
115 120 * 125 

Ser Asn Leu Arg Pro Pro Ser Arg Val Asp Gly Met Val Gly Asp Asp 
130 135 140 

Pro Tyr Asn Pro Tyr Lys Tyr Ser Asp Asp Asn Pro Tyr Tyr Asn Tyr 
145 150 155 * 160 

Tyr Asp Thr Tyr Glu Arg Pro Arg Pro Gly Gly Arg Tyr Arg Pro Gly 
165 170 175 

Tyr Gly Thr Gly Tyr Phe Gin Tyr Gly Leu Pro Asp Leu Val Ala Asp 
180 185 190 

Pro Tyr Tyr lie Gin Ala Ser Thr Tyr Val Gin Lys Met Ser Met Tyr 
195 200 205 

Asn Leu Arg Cys Ala Ala Glu Glu Asn Cys Leu Ala Ser Thr Ala Tyr 
210 215 220 

Arg Ala Asp Val Arg Asp Tyr Asp His Arg Val Leu Leu Arg Phe Pro 
225 230 235 240 

Gin Arg Val Lys Asn Gin Gly Thr Ser Asp Phe Leu Pro Ser Arg Pro 
245 250 255 

Arg Tyr Ser Trp Glu Trp His Ser Cys His Gin His Tyr His Ser Met 
260 265 270 

Asp Glu Phe Ser His Tyr Asp Leu Leu Asp Ala Asn Thr Gin Arg Arg 
275 280 285 

Val Ala Glu Gly His Lys Ala Ser Phe CyB Leu Glu Asp Thr Ser Cys 
290 295 300 

Asp Tyr Gly Tyr His Arg Arg Phe Ala Cys Thr Ala His Thr Gin Gly 
305 310 315 320 
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Leu Ser Pro Gly Cys Tyr Asp Thr Tyr Gly Ala Asp He Asp Cys Gin 
325 330 335 

Trp He Asp lie Thr Asp Val Lys Pro Gly Asn Tyr He Leu Lys Val 
340 345 * 350 

Ser Val Asn Pro Ser Tyr Leu Val Pro Glu Ser Asp Tyr Thr Asn Asn 
355 360 365 

Val Val Arg Cys Asp He Arg Tyr Thr Gly His His Ala Tyr Ala Ser 
370 375 380 

Gly Cys Thr He Ser Pro Tyr 
385 390 

(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
GATCCCAGCA GCAGCCGCCG CGTGAACCGC CGGCTGCTCC GGGTGCTTGG CGTCAGCAGA 
TCCAGTGGGA AAACAACGGT CAGGTTTTCT CCCTG 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTGTCCCTGG GTTCCCAGTA CCAGCCGCAG CGTCGTCGTG ACCCGGGTGC TGCTGTTCCG 
GGTGCTGCTA ACGCTTCCGC TCAGC 
(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
AGCCGCGTAC CCCGATCCTG CTGATCCGTG ACAACCGTAC CGCGGCCGCT CGTACCCGTA 
CCGCTGGTTC CTCCGGTGTT ACTGCA 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
GGTCGTCCGC GTCCGACCGC GCGCCACTGG TTCCAGGCTG GTTACTCCAC CTCCCGTGCT 
CGTGAAGCTG GTGCTTCCCG TGCTGAAAAC CAG 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI - SENSE : YES 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:9: 
ACCGCTCCGG GTGAAGTTCC AGCGCTGTCC AACCTGCGTC CGCCGTCCCG TGTTGACGGT 
ATGGTTGGTG ACGACCCGTA CAACCCGTAC A 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 



60 
91 
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(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
AGTACTCCGA CGACAACCCG TACTACAACT ACTACGACAC CTACGAGCGC CCGCGTCCGG 60 
GTGGTCGTTA CCGTCCGGGT TACGGTAC 88 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll: 
CGGTTACTTC CAGTACGGTC TGCCGGACCT GGTTGCTGAC CCGTACTACA TCCAGGCTTC 60 
CACCTACGTT CAGAAAATGT CCATGTAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
AACCTGCGTT GCGCTGCTGA AGAAAACTGC CTGGCTTCCA CCGCTTACCG TGCTGACGTT 
CGTGACTACG ACCACCGTGT TCTGCTGCGT TTCCCGC 



60 
97 
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(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGCGTGTTAA AAACCAGGGC ACCTCCGACT TCCTGCCGTC CCGTCCGCGT TACTCCTGGG 60 
AATGGCACTC CTGCCACCAG CACTACCACT CCATGGACG 99 
(2) INFORMATION FOR SEQ ID NO : 14 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AATTCTCCCA CTACGACCTG CTGGACGCTA ACACCCAGCG TCGTGTTGCT GAAGGTCACA 60 
AAGCTTCCTT CTGCCTGGAA GACACCTCCT GCGACTACG 99 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTTACCACCG TCGTTTCGCT TGCACCGCTC ACACCCAGGG TCTGTCCCCG GGTTGCTACG 60 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06830 



- 67 - 



PCT/AU97/00505 



ACACCTACGG TGCTGACATC GACTGCCAGT GGA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TCGACATCAC CGACGTTAAA CCGGGTAACT ACATCCTGAA AGTTTCCGTT AACCCGTCCT 
ACCTGGTTCC GGAATCCGAC TACACCAACA ACG 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 77 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TGGTTCGTTG CGATATCCGT TACAC CGGTC ACCACGCTTA CGCTTCCGGT TGCACCATCT 
CCCCGTACTA ATGATAG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
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GAAAACCTGA CCGTTGTTTT CCCACTGGAT CTGCTGACGC CAAGCACCCG GAGCAGCCGG 

CGGTTCACGC GGCGGCTGCT GCTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 85 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 
(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CGGAAGCGTT AGCAGCACCC GGAACAGCAG CACCCGGGTC ACGACGACGC TGCGGCTGGT 
ACTGGGAACC CAGGGACAGC AGGGA 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GTAACACCGG AGGAACCAGC GGTACGGGTA CGAGCGGCCG CGGTACGGTT GTCACGGATC 
AGCAGGATCG GGGTACGCGG CTGCTGAG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TTCAGCACGG GAAGCACCAG CTTCACGAGC ACGGGAGGTG GAGTAACCAG CCTGGAACCA 
GTGGCGCGCG GTCGGACGCG GACGACCTGC A 
(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 
GTTGTACGGG TCGTCACCAA CCATACCGTC AACACGGGAC GGCGGACGCA GGTTGGACAG 
CGCTGGAACT TCACCCGGAG CGGTCTGGTT 
(2) INFORMATION FOR SEQ ID NO:23: 

(A) LENGTH: 91 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL : YES 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGTAACCCGG ACGGTAACGA CCACCCGGAC GCGGGCGCTC GTAGGTGTCG TAGTAGTTGT 
AGTACGGGTT GTCGTCGGAG TACTTGTACG G 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL : YES 
(iv) ANTI -SENSE: YES 



SUBSTITUTE SHEET (RULE 26) 



WO 98/06830 



- 70 



PCT/AU97/00505 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
GCAGGTTGTA CATGGACATT TTCTGAACGT AGGTGGAAGC CTGGATGTAG TACGGGTCAG 
CAACCAGGTC CGGCAGACCG TACTGGAAGT ACCGGTAC 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
ACACGCTGCG GGAAACGCAG CAGAACACGG TGGTCGTAGT CACGAACGTC AGCACGGTAA 
GCGGTGGAAG CCAGGCAGTT TTCTTCAGCA GCGCAAC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 26: 
AATTCGTCCA TGGAGTGGTA GTGCTGGTGG CAGGAGTGCC ATTCCCAGGA GTAACGCGGA 
CGGGACGGCA GGAAGTCGGA GGTGCCCTGG TTTTTA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCAGGAGGTG TCTTCCAGGC AGAAGGAAGC TTTGTGACCT TCAGCAACAC GACGCTGGGT 60 
GTTAGCGTCC AGCAGGTCGT AGTGGGAG B8 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 
GGCAGTCGAT GTCAGCACCG TAGGTGTCGT AGCAACCCGG GGACAGACCC TGGGTGTGAG SO 
CGGTGCAAGC GAAACGACGG TGGTAACCGT AGTC 94 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 

(iv) ANTI - SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TGGTGTAGTC GGATTCCGGA AC CAGGTAGG ACGGGTTAAC GGAAACTTTC AGGATGTAGT 
TACCCGGTTT AACGTCGGTG ATGTCGATCC ACT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic oligonucleotide" 

(iii) HYPOTHETICAL: YES 
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(iv) ANTI- SENSE : YES 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 30: 
GATCCTATCA TTAGTACGGG GAGATGGTGC AACCGGAAGC GTAAGCGTGG TGACCGGTGT 
AACGGATATC GCAACGAACC ACGTTGT 
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THE CLAIMS 

1. A synthetic polynucleotide encoding an amino 
acid sequence selected from the group consisting of lysyl 
oxidase, lysyl oxidase- like proteins and variants 

5 thereof . 

2. A synthetic polynucleotide according to claim 1 
wherein the synthetic polynucleotide comprises a 
nucleotide sequence which is derived from a nucleotide 
sequence expressed in a vertebrate species including a 

io mammalian or avian species. 

3 . A synthetic polynucleotide according to claim 2 
wherein the synthetic polynucleotide sequence comprises a 
nucleotide sequence which is derived from a nucleotide 
sequence expressed in humans . 

15 4. A synthetic polynucleotide according to any one 

of claims 1-3 wherein the synthetic polynucleotide has a 
CAI score of at least 0.30 in a particular host cell. 

5. A synthetic polynucleotide according to claim 4 
wherein the particular host cell is selected from the 
group consisting of E. coli, Saccharomyces cerevisxae and 
Pichia . 



20 



6. A synthetic polynucleotide comprising the 
nucleotide sequence shown in Figure 2 (SEQ ID NO: 3) or a 
fragment thereof, wherein the fragment thereof encodes an 

25 amino acid sequnece which retains the ability of lysyl 
oxidase or lysyl oxidase- like proteins to catalyse the 
oxidation of primary amines. 

7. A synthetic polynucleotide according to claim 6 
wherein the nucleotide sequence commences at the codon 

30 encoding amino acid residue 141. 
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8. A synthetic polynucleotide according to claim 6 
wherein the nucleotide sequence commences at the codon 
encoding amino acid residue 144. 



9. A synthetic polynucleotide according to any or 
of claims 6 to 8 wherein at least 5 0% of the nucleotide 
mutations indicated in Figure 2 (SEQ ID NO:3) have been 
introduced. 



10. A synthetic polynucleotide according to any . 
of claims 6 to 8 wherein the number of nucleotide 
mutations which have been introduced produce a CAI sc 
for the synthetic polynucleotide in E. coli of at lea 
0.30. 



11. A synthetic polynucleotide according to any one 
of claims 1-6 wherein at least 50% of codons for any 
particular amino acid are selected and altered to reflect 
preferred codon usage in the particular host. 

12. A synthetic polynucleotide according to any one 
of claims 1-6 wherein at least 5 0% of codons for amino 
acids for which altered codons are desirable are altered. 

13. A synthetic polynucleotide according to claim i; 
wherein at least 60% of codons for amino acids for which 
altered codons are desirable are altered. 



14. A synthetic polynucleotide according to claims 
12 wherein at least 70% of codons for amino acids for 
which altered codons are desirable are altered. 

15. A synthetic polynucleotide comprising an 
nucleotide sequence encoding a lysyl oxidase, lysyl 
oxidase like protein or variant thereof, wherein all or 
part of the 5' or 3 ' untranslated region, and/or intronic 
region of the synthetic polynucleotide is deleted. 
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16 . A synthetic polynucleotide sequence encoding a 
lysyl oxidase, lsysyl oxidase- like protein or variant 
thereof, wherein all or part of the region encoding the 
signal peptide, and/or region encoding the pro-peptide, 

5 of the polynucleotide is deleted. 

17. A synthetic polynucleotide sequence encoding a 
lysyl oxidase, lsysyl oxidase -like protein or variant 
thereof, wherein all or part of the region encoding the 
signal peptide, and/or region encoding the pro-peptide, 

io of the polynucleotide is deleted and wherein all or part 
of the 5' or 3 ' untranslated region, and/or intronic 
region of the synthetic polynucleotide is deleted. 

18. A synthetic polynucleotide comprising a 
nucleotide sequence encoding an amino acid sequence 

15 selected from the group consisting of lysyl oxidase, 

lysyl oxidase- like proteins and variants thereof and a 
further nucleotide sequence wherein the nucleotide 
sequence encoding lysyl oxidase, lysyl oxidase- like 
protein or a variant thereof and the further nucleotide 

20 sequence are chemically linked so as to encode a single 
open reading frame. 

19 . A synthetic polynucleotide according to claim 18 
wherein the further nucleotide sequence is chemically 
linked to the 5 1 end of the nucleotide sequence encoding 

25 lysyl oxidase, lysyl oxidase- like protein or a variant 
thereof . 

20. A synthetic polynucleotide according to claim 18 
or 14 wherein the further nucleotide sequence encodes all 
or part of a protein which causes the lysyl oxidase, 

30 lysyl oxidase- like protein or variant thereof to be 
expressed in any one of the following environments 
including the extracellular environment, the cell 
surface environment, the periplasmic space or the 
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intracellular environment, or a combination of any one of 
the environments . 

21. A synthetic polynucleotide according to claim 20 
wherein the further nucleotide sequence encodes 

5 glutathione S- transferase . 

22 . A recombinant DNA molecule comprising a 
synthetic polynucleotide of any one of claims 1 to 21. 

23 . A recombinant DNA molecule according to claim 
22, the recombinant DNA molecule further comprising 

10 vector DNA. 

24. A recombinant DNA molecule according to claim 23 
wherein the vector is selected from the group consisting 
of pBR322, pBluescript II, pGEX-2T, pTrc99A, pET series 
vectors, pET3d, pCNNinv, pYEX-BX and derivatives of these 

15 vectors. 

25. A host cell containing a recombinant DNA 
molecule according to any one of claims 22, 23 or 24 or a 
synthetic polynucleotide according to any one of claims 1 
to 21. 

20 2 6. A host cell according to claim 25 wherein the 

host cell is selected from the group consisting of 
prokaryotic cells and eukaryotic cells. 

27. A host cell according to claim 26 wherein the 
prokaryotic cells are bacteria. 

25 28. A host cell according to claim 25 wherein the 

host cell is selected from the group consisting of E. 
coli, Saccharomyces cerevisea and Pichea. 

29. An expression product encoded by a synthetic 
polynucleotide of any one of claims 1 to 21. 
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30. A method for producing an expression product 
according to claim 29, the method comprising the 
following steps: 

(i) providing a host cell according to any one of 
5 claims 25 to 28; 

(ii) maintaining the host cell in conditions 
suitable for expression of the synthetic polynucleotide; 
and 

(iii) isolating the expression product. 



31. A method of cross- linking molecules that contain 
primary amines, wherein the molecules are associated with 
the cell membrane or the extracellular environment, the 
method comprising contacting the primary amine group with 
an expression product according to claim 29, under 
conditions suitable to cross -link the molecules. 



32. A method of cross -linking molecules that contain 
primary amines in vitro, the method comprising the step 

of contacting the primary amines with an expression 
product according to claim 29 under in vitro conditions 
which are suitable to cross -link the molecules. 

33. A method of cross - linking tropoelastin, collagen 
or both the method comprising the step of contacting the 
primary amines of collagen and and/or tropoelastin with 
an expression product according to claim 29 under 
conditions which are suitable to cross -link the collagen 
and/or with tropoelastin.' 



34. A method of applying at least one cross -linkable 
primary amine containing molecule to a surface, wherein 
the surface provides primary amines available for cross - 
linking, the method comprising exposing the surface to 
the at least one primary amine containing molecule and 
an expression product according to claim 29 under 
conditions to permit the expression product to cause 
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cross linking of the molecules to each other and to the 
surface. 

35. The method according to any one claims 31 to 34 
wherein the molecule is a peptide or polypeptide and the 
primary amine is associated with an amino acid side 
chain. 

36. The method of claim 35 wherein the peptide or 
polypeptide comprises the amino acid sequence of 
tropoelastin or a fragment thereof. 

37. The method of claim 35 wherein the peptide or 
polypeptide comprises the amino acid sequence of collagen 
or a fragment thereof. 

38. The method according to any one of claims 31 to 
34 wherein the molecule is a non-peptide molecule. 

39. A method of accelerating the rate of wound 
healing, the method comprising the step of administering 
an expression product according to claim 29, to a wound 
under conditions which are suitable for cross -linking 
molecules which contain primary amines. 

40. A method of accelerating the rate of wound 
healing according to claim 39, the method further 
comprising the step of administering molecules that 
contain primary amines to the wound under conditions 
which are suitable for cross -linking the molecules 
containing the primary amine to the wound. 

41. The method according to claim 40 or 30 wherein 
the molecules that contain primary amines are 
tropoelastin molecules or derivatives thereof. 
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42. The method according to claim 39 or 40 wherein 
the molecules that contain primary amines are collagen 
molecules or derivatives thereof . 

43. The method according to claim 39 or 40 wherein 
the molecules that contain primary amines are collagen 
molecules or derivatives thereof, and tropoelastin 
molecules or derivatives thereof. 

44. A formulation for use in accelerating the rate 
of wound healing, the formulation comprising an 
expression product according to claim 29 and molecules 
containing primary amines, together with a 
pharmaceutically acceptable carrier, diluent or 
excipient . 

45. A formulation according to claim 44 wherein the 
molecules containing primary amines include at least one 
of tropoelastin or a derivative thereof and collagen or a 
derivative thereof, together with a pharmaceutically 
acceptable carrier, diluent or excipient. 

46. A kit for use in accelerating wound healing, the 
kit comprising a first vial containing an expression 
product according to claim 29 and a second vial 
containing molecules which contain primary amines. 

47. A kit according to claim 46 wherein the second 
vial contains tropoelastin or a derivative thereof. 

48. A kit according to claim 46 wherein the second 
vial contains collagen or a derivative thereof. 

49. A kit according to claim 47 where the kit 
additionally comprises a third vial containing collagen 
or a derivative thereof. 
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50. A kit according to claim 46 wherein the second 
vial contains collagen or a derivative thereof and 
tropoelastin or a derivative thereof. 

51. A method according to claim 39 or 40 wherein the 
expression product is administered to the wound by direct 
application of the product to the wound. 



52 . A method according to claim 39 or 40 wherein the 
expression product is administered to the wound by 
10 attaching the product to a matrix which is itself 
administered to the wound. 



53. A method according to claim 39 or 40 wherein 
the expression product is administered to the wound by 
application of a pharmaceutical formulation comprising an 
is expression product according to claim 29 and a 

pharmaceutically acceptable carrier, excipient or 
diluent . 



54. A method of deccelerating the rate of wound 
healing, the method comprising the step of administering 

20 antibodies which are raised against an expression product 
according to claim 29, to a wound under conditions which 
allow the antibodies to substantially inhibit the 
oxidation of primary amine moieties. 

55. A reagent or formulation thereof for use in 

25 modifying the rate of wound healing, wherein the reagent 
is generated by oxidising a primary amine contained in a 
chemical species with an expression product according to 
claim 29 so as to link the chemical species to a 
heterologous surface. 

30 56. A method of promoting fibrosis, the method 

comprising contacting an expression product according to 
claim 29 with a reagent which substantially enhances the 
specific acitivity of the expression product, under 
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conditions which are suitable for the oxidation of 
primary amine moieties. 

57. A method of promoting fibrosis, the method 
comprising administering an expression product according 

5 to claim 2 9 to a particular site under conditions 

suitable for the oxidation of primary amine moieties. 

58. A method of inhibiting fibrosis, the method 
comprising the step of administering antibodies which are 
raised against an expression product according to claim 

10 29, to a wound under conditions which allow the 

antibodies to substantially inhibit the oxidation of 
primary amine moieties. 

59 . A method for screening agents which 
substantially enhance or substantially inhibit the 

15 specific activity of lysyl oxidase, or lysyl oxidase -like 
proteins or variants thereof, the method comprising the 
step of contacting the agent with the expression product 
of the invention and measuring the specific activity of 
the expression product. 

20 60. A method for detecting a protein or non-protein 

substrate in a sample, the method comprising contacting 
the sample with an expression product according to claim 
29 and detecting the oxidation of any primary amine 
moieties in the sample. 

25 61. A method for quantifying the amount of a protein 

or non-protein substrate in a sample, wherein the number 
of primary amine moieties per substrate molecule is 
known, the method comprising contacting the sample with 
the expression product according to claim 29 and 

30 detecting the amount of oxidation of any primary amine 
moieties in the sample. 
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62 . A method of cancer therapy in a host in need of 
such treatment, comprising administering a synthetic 
polynucleotide according to any one of claims 1 to 21, or 
an expression product according to claim 29, alone or as 
part of a complex delivered extracellularly in the host. 

63. A method of regulating cell growth, the method 
comprising contacting a cell with an expression product 
according to claim 29 under conditions which permit the 
regulation of cell growth. 

64. A method of modifying a lysine or other primary 
amine residue, the method comprising oxidising the 
primary amine by exposure to an expression product 
according to claim 29. 

65. A tissue glue comprising and expression product 
according to claim 29 and a pharmaceutically acceptable 
carrier and/or diluent. 

66. A tissue glue according to claim 65 which 
additionally comprises at least one primary amine 
molecule species. 

67. An antibody raised to an expression product 
according to claim 29. 

68. The antibody of claim 67 which is specific to an 
expression product according to claim 29 . 

69. A method of cross- linking molecules that contain 
primary amines according to claim 35, wherein the method 
comprises providing a complex consisting cross -linked 
elastin and collagen molecules, and contacting the 
complex with an expression product according to claim 29. 

70. A method of accelerating the rate of wound 
healing, the method comprising the step of attaching the 
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expression product according to claim 29 to a matrix and 
applying the matrix to the wound. 

71. The method of claim 70 wherein the matrix 
comprises a complex of cross-linked elastin and collagen 
5 molecules. 
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1/1 31/11 

ATG CQC TTC GCC TOG ADC QTG CTC CTG CTC GGG CCT TTG CAG CTC TGC GCG CTA GTG CAC 

E — 8 — & H 1 X h h LGPLOLCAT, Vp 

61/21 91/3l 

TGCC^CCTCCCKX:<^GGCC^CAG(^CCCCC«CGCGM 

CAPPAAGQQQp PRE pp AAp 

151/51 

GCC TGG CGC CAG CAG ATC CAA TOG GAG AAC ARC GOG CAG GIG TIC AGC TOG CTG AGC CTG 
181/61 211/71 

GGC TCA CAG TAC CAG CCT CAG CGC CGC CGG GAC CCG GGC GCC GCC GTC CCT GOT GCA GCC 

241/81 271/91 
AMGCCTCCGCCCAGCAGCCCCGCACTCCGATCCTC 

SAQQPRTPII «I«IKl>H R TA 
301/101 331/111 

GOG GCG CGA ACG CGG ACG GCC GGC TCA TCT GGA GTC ACC GCT GGC CGC CCC AGG CCC ACC 

RTRTAGSS QVTAGRP RPT 
361/121 391/131 

GOC CGT CAC TGG TTC CAA GCT GGC TAC TCG ACA TCT AGA GCC CGC GAA CGT GGC GCC TOG 

«1 £i HWFQA6Y ST6RARERGAS 
421/141 451/151 

CGC GCG GAG AAC CAG ACA GCG CCG GGA GAA GTT CCT GCG CTC AGT AAC CTG CGG CCG CCC 

RAENQTAPGEVPALSNLRPP 
481/161 511/ln 

AGC CGC GTG GAC GGC ATG GTG GGC GAC GAC CCT TAC AAC CCC TAC AAG TAC TCT GAC GAC " 

SRVDGHVGDDPTHPYKYBDD 
541/1B1 571/191 

AAC CCT IRT TAC AAC TAC TAC GAT ACT TAT GAA AGG COC AGA OCT GGG GGC AGG TAC CGG 

L « M YYNYYDT YERPRPGGRYR 
601/2 °1 631/211 

CCC GGA TAC GGC ACT GGC TAC TTC GAG TAC GGT CTC OCA GAC CTG GTG GCC GAC CCC TAC 

L/?„ Y0T0 **Q*GLPDLVADPY 
661/221 691/231 

TAC ATC CAG GOG TOG ACG TAC GTG CAG AAG ATG TCC ATG TAC AAC CTG AGA TGC GCG GCG 

791/L, QABTYV Q* MB »*NLRCAA 
721/241 751/251 

GAG GAA AAC TGT CTG GCC AGT ACA GCA TRC AGG GCA GAT GTC AGA GAT TAT GAT CAC AGG 
781/261 811/271 

GTG CTG CTC AGA VHZ CCC CAA AGA GTG AAA AAC CAA GGG ACA TCA GAT CTC TEA CCC AGC 

Ml/Ml LRFPQRVK «QO»SDFI.PS 
841/281 871/291 

OGA CCA AGA TAT TCC TOG GAA TGG CAC AGT TGT CAT CAA CAT TAC CAC ACT ATG GAT GAG 

oni /mi RY8WEWH SCHQ H YH S MDE 
901/301 931/311 

TTT AGC CAC TAT GAC CTG CTT GAT GCC AAC ACC CAG AGG AGA GTG GCT GAA GGC CAC AAA 
FSHYDLtiDAHTQRRVAEGHK 



Fig. 1(1) 
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961/321 991/331 

GCA AGT TTC TOT CTT GAA GAC RCA. TCC TGT GAC TAT GGC TAG CAC AGG OQA TTT OCA TGT 

ASFCLEDTSCDYGYHRRFAC 
1021/341 1051/351 

ACT GCA CAC ACA CAG GGA TTG AGT CCT GGC TGT TAT GAT ACC TAT GGT GCA GAC ATA GAC 

TAHTQGL6PGCYDTYGADID 
1081/361 1111/371 

TGC CAG TOG ATT GAT ATT ACA GAT GTA AAA. CCT GGA AAC TAT ATC CTA AAG GTC AGT GTA 
CQWIDITDVKPGNYILKVSV 
1141/381 1171/391 

AAC CCC AGG TAG CTG GTT OCT GAA TCT GAC TAT ACC AAC AAT GTT GTG CGC TGT GAC ATT 
NPSYLVPE6DYTNNVVRCDI 
1201/401 1231/411 

CGC TAC ACA GGA CAT CAT QCG TAT GCC TGA GGC TGC ACA ATT TCA COG TAT 
RYTGHHAYASGCTISPY 



Fig. K2) 
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BamRl 

2/ 31/11 

GA TCC CAG CAG CAG CCG CCG CGT GAR CCG CCG GCT OCT CCG GGT GCT TGG CGT CAG CAG 

SQQQPPREPPAAPGAWRQQ 
61/21 9l/31 

ATC CAG TGG GAA AAC RAC GGT CAG GTT TTC TCC CTG CTC TCC CTC GGT TCC CAG TAC CAG 

IQWENHGQVFBLL6LG6QYQ 
"1/41 151/51 

CCG CAG OGT CGT CGT GAC CCG GGT GCT GCT GTT CCG GGT GCT GCT AAC GCT TCC GCT CAG 

PQRRRDPGAAVPGAAHAGAQ 
181/61 211/71 

CAG CCG CGT ACC CCG ATC CTG CTG ATC CGT GAC AAC CGT ACC GCG GCC GCT CGT ACC CGT 
QPRT PILLIRDHRTAAARTR 



Pstl 

241/81 271/91 

ACC GCT GGT TCC TCC GGT GTT ACT GCA GGT CGT CCG CGT CCG ACC GCG CGC CAC TGG TCC 

TAG S6GVTAGRPRPTARHWF 
301/101 33i /nl 

CAG GCT GGT TAC TCC ACC TCC CGT GCT CGT GAA GCT GGT GCT TCC CGT GCT GAA AAC CAG 

Q AGY GTSRARE AGASRAENQ 
361/121 39l/13X 

ACC GCT CCG GGT GAA GTT CCA GCG CTG TCC AAC CTG CGT CCG CCG TCC CGT GTT GAC GGT 

TAPGEVPALSNLRPPSRVDG 
421/141 45l/151 

ATC GTT GOT GAC GAC CCG TAC AAC CCG TAC AAG TAC TCC GAC GAC AAC CCG TAC TAC AAC 
MVGDDPYNPYKYSDDNPYYN 



Kpnl 

481/161 5ll/171 

TAC TAC GAC AOC TAC GAG CGC CCG COT CCG GGT GOT OGT TAC CGT CCG GOT TAC GOT ACC 

YYDTYERPRPQGRYRPGYGT 
541/181 571/191 

GGT TRC TTC CAG TAC GOT CTG COG GAC CTG GTT OCT GAC CCG TAC TRC ATC CAG GCT TCC 
GY FQYGI.PDLVADPYYIQAS 

Fig. 2(1) 
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601/201 631/211 

ACC TAC GTT CAG AAA ATG TCC ATG TAC AAC CTG OGT THC GCT GCT GAA GAA ARC TGC CTC 

TYVQKMSMYHLRCAAEEHCL 
661/221 691/231 

OCT TCC ACC GCT TAC CGT GCT GAC GTT CGT GAC TAC GAC CAC CGT GTT CTC CTG CGT TTC 

AE *TAYRADVBDYDHRVLLRF 
721/241 751/251 

CCG CAG CGT GOT AAA AAC CAG GGC ACC TCC GAC TTC CTG CCG TCC CGT COG CGT TAC TCC 
PQRVKNQGTSDFLPSRPRYS 



EcdRl 

781/261 811/271 

TGG GAA TGG CAC TCC TGC CAC CAG CAC TAC CAC TCC ATG GAC GAA TTC TOC CAC TAC GAC 

WEWHSCHQHYHSMDEFSHYD 
841/281 871/291 

CTG CIS GAC GCT AAC ACC CAG CGT OCT GTT GCT GAA GGT CAC AAA GCT TCC TTC TGC CTG 
LLDAHTQRRVAEGHKASFCL 
901/301 931/311 

GAA GAC ACC TCC TGC GAC TAC GGT TAC CAC CGT CGT TTC GCT TGC ACC GCT CAC ACC CAG 

EDTSCDYGYHRRFACTABTQ 
961/321 991/331 

GGT CTG TCC CCG GGT TGC TAC GAC ACC TAC GGT GCT GAC ATC GAC TGC CAG TGG ATC GAC 
GLSPGCYDTYGADIDCQWID 
"21/341 1051/351 

ATC ACC GAC GTT AAA CCG GGT AAC TAC ATC CTG AAA GTT TCC GTT AAC COG TCC TAC CTG 
ITDVKPGHYIIiKVSVNPSYL 
1081/361 1111/371 

GTT COG GAA TOC GAC TAC AOC AAC AAC GIG GTT CGT TGC GAT ATC CGT TAC ACC GGT CAC 
VPESDYTNHVVRCDIRYTGH 



BamHI 

1141/381 1171/391 

CAC GCT TAC GCT TCC GGT TGC ACC ATC TCC CCG TAC TAA TGA TAG 

HAYASGCTISPY*** 

Fig. 2(2) 
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Bl CCAACAGCAGCCCCCGCGCGAGCCGCCGGCGGCTCCGGGCGCCTGGCGCC 130 

in iiiiiiii inn it mum mmii n urn i 

6 CCAGCAGCAGCCGCCGCGTGAACCGCCGGCTGCTCCGGGTGCTTGGCGTC 55 

• • * • • 

131 AGCAGATCCAATGGGAGAACAACGGGCAGGTGTTCAGCTTGCTGAGCCTG 180 

iiiiiimi urn mum inn m i inn mi 

56 AGCAGATCCAGTGGGAAAACAACGGTCAGGTTTTCTCCCTGCTGTCCCTG 105 

• * • • • 

181 GGCTCACAGTACCAGCCTCAGCGCCGCCGGGACCCGGGCGCCGCCGTCCC 230 

ii it imiimii inn ii n iiiiiiii ii ii ii n 

106 GGTTCCCAGTACCAGCCGCAGCGTCGTCGTGACCCGGGTGCTGCTGTTCC 155 

■ * • ■ • 

231 TGGTGCAGCCAACGCCTCCGCCCAGCAGCCCCGCACTCCGATCCTGCTGA 280 

mil ii inn urn iiiiiiii ii ii imimimi 

156 GGGTGCTGCTAACGCTTCCGCTCAGCAGCCGCGTACCCCGATCCTGCTGA 205 

• • • • • 

281 TCCGCGACAACCGCACCGCCGCGGCGCGAACGCGGACGGCCGGCTCATCT 330 

Mil IIIIIIII Hill II II II II II II II II II II 
206 TCCGTGACAACCGTACCGCGGCCGCTCGTACCCGTACCGCTGGTTCCTCC 255 

■ • • • • 

331 GGAGTCACCGCTGGGCGCCCCAGGCCCACCGCCCGTCACTGGTTCCAAGC 380 

II II IIIIIIII II II I II Mill II I I I I I I I 1 I t 1 II 

256 GGTGTTACCGCTGGTCGTCCGCGTCCGACCGCGCGCCACTGGTTCCAGGC 305 

• • • • * 

381 TGGCTACTCGAC&TCTAfiAGCCCGCGAACGTGGCGCCTCGCGCGCGGAGA 430 

III Mill II II I II II III III II II II II II I 

306 TGGTTACTCCACCTCCCGTGCTCGTGAAGCTGGTGCTTCCCGTGCTGAAA 355 

• ■ a * • 

431 ACCAGACAGCGCCGGGAGAAGTTCCTGCGCTCAGTAACCWGCGGCCGCCC 480 

limit ii urn iiiiiiii urn iiiiiiii iiiii 

356 ACCAGACCGCTCCGGGTGAAGTTCCAGCGCTGTCCRACCTGCGTCCGCCG 405 

< • • • • 

481 AGCCGCGTGGACGGCRTGGTGGGCGACGACCCraACRACCCCTACAA.GTA 530 

III II lllll IIIII II iiiiiiii iiiiiiii iiiiiiii 

406 TCCCGTGTTGACGGTATGGTTGGTGACGACCCGTACAACCCGTACAAGTA 455 

531 CTCTGACGACAACCCTTATTACAACTACTACGATACTTATGAAAGGCCCA 580 

Ml 11111111111 II lllllllllillll II II 11 I II 
456 CTCCGACGACAACCCGTACTACAACTACTACGACACCTACGAGCGCCCGC 505 

• • • • • 

581 GAG C TGGGGGCAGGT AC CGGC CC GG AT ACGGCACTGGC T ACTTCC AGTAC 630 

i ii ii ii i inn ii ii mn ii ii 1 1 1 1 1 1 1 1 1 1 1 1 

506 GTCCGGGTGGTCGTTACCGTCCGGGTTACGGTACCGGTTACTTCCAGTAC 555 

• • • • a 

631 GGTCTCCGRGACCTGGTGGCCGACCCCTACTACATCCAGGCGTCCACGTA 680 

HIM il IIIIIIII II lllll lllllllllillll lllll II 
556 GGTCTGCCGGACCTGGTTGCTGACCCGTACTACATCCAGGCTTCCACCTA 605 
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681 CGTGCAGAAGATGTCCATGTACAACCTGAGATGCGCGGCGGAGGAAAACT 730 

in mil 1 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 i him ii i! mini 

606 CGTTCAGAAAATGTCCATGTACAACCTGCGTTGCGCTGCTGAAGAAAACT 655 

• • • • • 

731 GTCTGGCCAGTACAGCATAC^GOT 780 

I Hill II II III I II II II I II il II III I 

656 GCCTGGCTOCCACCGCTrACCGTGCTGACGTTCGTGACTACGACCACCGT 705 

• « • • . 

7B1 GTGCTCCTCAGATTTCCCCAAAGAGTGAAAAACCaUVGGGACATCAGATTT 830 

II Hill I II II II I II lllllill II II II II II 

706 GTTCTGCTGCGTTTCCCGCAGCGTGTTAAAAACCAGGGTACCTCCGACTT 755 

• • • • • 

831 CTTACCCAGCCGACCAAGATAOTCCTGGGAATGGCACAGTTGTCATCAAC 880 

I I II HI M I II lllllllllllllll II II II I 
756 CCTGCCGTCCCGTCCGCGTTACTCCTGGGAATGGCACTCCTGCCACCAGC 805 

• • • • • 

881 ATTACCACAGTATGGATGAGTTTAGCCACTATGACCTGCTTGATGCCAAC 930 

i mill urn ii ii iiiiii mum n u m 

806 ACTACCACTCCATGGACGAATTCTCCCACTACGACCTGCTGGACGCTAAC 855 

• • • « • 

931 ACCCAGAGGAGAGTGGCTGAAGGCCACAAAGCAAGTTTCTGTCTTGAAGA 980 

mm i i ii iiimii mum inn n urn 

856 ACCCAGCGTCGTGTTGCTGAAGGTCACAAAGCTTCCTTCTGCCTGGAAGA 905 

• * • • • 

981 CACATCCTGTGACTATGGCTACCACAGGCGATTTGCATGTACTGCACJACA 1030 

m mil urn ii mm i n n n n u n mi 

906 CACCTCCTGCGACTACGGTTACCACCGTCGTTTCGCTTGCACCGCTCACA 955 

• * • * m 

1031 CACAGGGATTGAGTCCTGGCTGTTATGATACCTATGGTGCAGACATAGAC 1080 

i urn ii ii ii ii ii ii urn mil urn in 

956 CCCAGGGTCTGTCCCCGGGTTGCTACGACACCTACGGTGCTGRCATCGAC 1005 

• • • • ■ 

1081 TGCCAGTGGATTGATATTACAGATGTAAAACCTGGAAACTATATCCTAAA 1130 

nullum u u u u u mil u mil mil u 

1006 TGCCAGTGGATCGACATCACCGACGTTAAACCGGGTAACTACATCCTGAA 1055 

• • • • • 

1131 GGTCAGTGTAWiCCCCAGCTACCTGGTTCCTGAATCTGftGTAT^ 1180 

ii ii mu 1 1 1 1 1 1 1 1 1 1 1 1 urn inn iiiiiii 

1056 AGTTTCCGTTAACCCGTCCTACCTGGTTCCGGAATCCGACTACACCAACA 1105 

• • • m m 

1181 ATCTTGTGCGOTOTGJ«:ATTCGCTAC»CAGGACATCAT^ 1230 

i mu ii u u ii ii urn ii ii u ii ii u ii 

1106 ACGTTGTTCGTTGCGATATCCGTTACACCGGTCAtXIACGCTTACGCTTCC 1155 
1231 GGCTGCACAATTTCACCGTAT 1251 

it mu ii ii urn 

1156 GGTTGCACCATCTCCCCGTAC 1176 
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BLOCK It Coding afcra nri 
OjLigo_l 

GATCC<^GCAGC^GCCGCCGCGTGAACCGCCGGCTGCTCCGGGTGCTTGGCGTCAGC^GA^ 
TCCAGTGGGAAAACAACGGTCAGGTTTTCTCCIC^ ( S EQ ID NO: 5) 

pCTGTCCCTGGGTTCCCAGTACCAGCCGCAGCGTCGTCGTGACCCGGGTG^ 
GGGTGCTGCTAACGCTTCCGGTCAGG (SEQ ID NO: 6) 

OliOQ 3 

pAGCCGCGTACCCCGATCCTGCTGATCCGTGACAACCGTACCGCGGCCGCTCGTACCCGT-- 
ACCGCTGGTTCCTCCGGTGTTACTGCA (SEQ ID NQ . ?) 

BLOCK 2 i Coding Btrand 
Oliao 4 

GGTCGTCCGCGTCCGACCGCGCGCCACTGGTTCCAGGCTGGTTACTCCACCTCCCGTGCT- 
CGTGAAGCTGGTGCTTCCCGTGCTGAAAACCAG (SEQ ID NO: 8) 

Oliao 5 

pACCGCTCCGGGTGAAGTTCCAGCGCTGTCCAACCTGCGTCCGCCGTCCCGTGTTGACGG- 
TATGGTTGGTGACGACCCGTACAACCCGTACA (SEQ ID NO: 9) 

Oliao 6 

pAGTACTCCGAGGACAACCCGTACTACAACTACTACGACACCTACGAGCGCCCGCGTCCG- 
GGTGGTCGTTACCGTCCGGGTTACGGTAC (SEQ ID NO: 10) 

BLOCK 3» Coding strand 

CGGTTACTTCGAGTACGGTCTGCCGGACCTGGTTGCTGACCCGTACTAC^ 
CACCTACGTTCAGAAAATGTCCATGTAC ( SEQ ID NO: 11) 

pAACCTGCGTTGCGCTGCTGAAGAAAACTGCCTGGCTTCCACCGCTTACCGT^ 
TCGTGACTACX5ACCACCGTGTTCIX3CTGCGOT (SEQ ID NO: 12) 
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O Uqo 9 

pAGCGTGTTAAAAACCAGGGCACCTCCGACTTCCTGCCGTCCCGTCCGCGTO 
GAATGG<^CTCCTGCCRCCAGC^TACC^CTC^ (SEQ ID NO: 13) 

BLOCK 4; Coding atrand 
Oligo 10 

AATTCTCCCACTACGACCTGCTGGACGCTAACACCCAGCGTCGTGTTGCTGAAGGTCACA- 
AAGCTTCCTTCTGCCTGGAAGACACCTCCTGCGACTACG (SEQ ID NO: 14) 

Oliao 11 

pGTTACCACCGTCGTTTCGCTTGKJACCGCTCACACCCAGGGTCTGTCCC^ 
GACACCTACGGTGCTGACATCGACTGCCAGTGGA (SEQ ID NO: 15) 

QUqo 12 

PTCGACATCACCGACGTTAAACCGGX3TAACTACATCCTGAAAGTTTCCGTTAACCCGTCC- 
TACCTGGTTCCGGAATCCGACTACACCAACAACG (SEQ ID NO: 16) 

Olioro 13 

pTGGTTCGTTGCGATATCCGTTACACCGGTCACCACGCOT 

TCCCCGTACTAATGATAG (SEQ ID NO: 17) 

COMPLEMENTARY OLIGONUCLEOTIDES 
BLOCK 1: Complementary strand 
Complementary to oliao 1 
pGAAAACCTGACCXSTTGTTTTCCCACTGGATC 

GCGGTTCACGCGGCGGCTGCTGCTGG (SEQ ID NO: 18) 

Complementary to oliao 9. 

pCGGAAGCGTTAGCAGCACCCGGAACAGCAGCACCCG^TCACGAC 

TACTGGGAACCCAGGGACAGCAGGGA (SEQ ID NO: 19) 

Fig. 4(2) 
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Complementary to olioo 3 

GTAACACCXK3AGGAACCAGCGGTACGGGTACGAGCGGCCGCGGTAGGGTTGTCACGGATC- 
AGCAGGATCGGGGTACGCGGCTGCTGAfi ( SEQ ID NO: 20) 



BLOCK 2 i Complementary strand 
Complementary to olioo 4 
pTTCAGCACGGG^GCACC^GCTTCAC 

AGTGGCGCGCGGTCGGACGCGGACGACCTGCA ( SEQ ID NO: 21) 

Complementary to olioo 5 
pGTTGTACGGGTCGTCACCAACCATACCGTCAACL^ 

GCGCTGGAACTTCACCCGGAGCGGTCTGGTT (SEQ ID NO: 22) 



Complementary to olioo 6 

CGTAACCCGGACGGTAACGACCACCCGGACGCGGGCGCTCGTAGGTGTCGTAGTAGTTGT- 
AGTACGGGTTGTCGTCGGAGTACTTGTACGG (SEQ ID N0 . 23) 

BLOCK 3t Complementary strand 
Complementary to olioo 7 
pGCAGGTTGTAC^TGGACATTTTCTGAACGTAGGTGGAA 

GCAACCAGGTCCGGCAGACCGTACTGGAAGTACCGGTAC (SEQ ID NO: 2 4) 

Complem entary to olioo 8 

pACACGCTGCGGGAAACGCAGCAGAACACGGTGGTCGTAGTC^ 

AGCGGTGGAAGCXIAGGCAGTTTTCTTCAGC^GCGCAAC (SEQ ID N0: 2 5 ) 

Complementary to olioo 9 
AATTCGTCCATGGAGTGGTAGTGCTGGTGGCAG 

CGGGACGGCAGGAAGTCGGAGGTGCCCTGGTTTTTA ( SEQ ID NO: 2 6) 
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BLOCK 4 ; Complementary strand 
Complementary to oliao 10 
pGCAGGAGGTGTCTTCGAGGCA^ 

TGTTAGCGTCCAGCAGGTCGTAGTGGGAG (SEQ ID NO: 27) 

Complementary to oliao 11 

pGGCAGTCGATGTCAGCACCGTAGGTGTCGTAGCAACCCGGGGACAGACCCTGGGTGTGA- 
GCGGTGCAAGGGAAAGGACGGTGGTAACCGTAGTC ( SEQ ID NO: 28) 

Complementary to oliao 12 
pTGGTGTAGTCGGATTCCGGAACCAGGTAGGACGGGT^ 

TTACCCGGTTTAACGTCGGTGATGTCGATCCACT (SEQ ID NO: 29) 

Complementary to olioo 13 
GATCCTATCATTAGTACGKSGGAGATG^ 

AACGGATATCGCAACGRACCACGTTGT (SEQ ID NO: 30) 
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